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TITLE 

GENES ENCODING BAEYER-VILLIGER MONOOXYGENASES 
FIELD OF THE INVENTION 
The invention relates to the field of molecular biology and 
5 microbiology. More specifically, genes have been isolated from a variety 
of bacteria encoding Baeyer-Villiger monooxygenase activity. 

BACKGROUND OF THE INVENTION 
In 1899, Baeyer and Villiger reported on a reaction of cyclic ketones 
with peroxymonosulfuric acid to produce lactones (Chem Ber 
10 32:3625-3633 (1 899)). Since then, the Baeyer-Villiger (BV) reaction has 
been broadly used in organic synthesis. BV reactions are one of only a 
few methods available for cleaving specific carbon-carbon bonds under 
mild conditions, thereby converting ketones into esters (Walsh and Chen, 
Angew.Chem.lnt.Ed. Engl 27:333-343 (1988)). 
15 In the last several decades, the importance of minimizing 

environmental impact in industrial processes has catalyzed a trend 
whereby alternative methods are replacing established chemical 
techniques. In the arena of Baeyer-Villiger (BV) oxidations, considerable 
interest has focused on discovery of enantioselective versions of the 
20 Baeyer-Villiger oxidation that are not based on peracids. Enzymes, which 
are often enantioselective, are valued alternatives as renewable, 
biodegradable resources. 

Many microbial Baeyer-Villiger monooxygenases enzymes (BVMOs 
) , which convert ketones to esters or the corresponding lactones (cyclic 
25 esters) (Stewart, Curr. Org. Chem. 2:195-216 (1998), have been identified 
from both bacterial and fungal sources. In general, microbial BV 
reactions are carried out by monooxygenases (EC 1.14.13.x) which use 
0 2 and either NADH or NADPH as a co-reductant. One of the oxygen 
atoms is incorporated into the lactone product between the carbonyl 
30 carbon and the flanking carbon while the other is used to oxidize the 
reduced NADPH producing H 2 0 (Banerjee, A. In Stereosel, Biocatal.; 
Patel, R.N., Ed.; Marcel Dekker: New York, 2000; Chapter 29, 
pp 867-876). All known BVMOs have a flavin coenzyme which acts in the 
oxidation reaction; the predominant coenzyme form is flavin adenine 
35 dinucleotide cofactor (FAD). . 

The natural physiological role of most characterized BVMOs is 
degradation of compounds to permit utilization of smaller hydrocarbons 
and/or alcohols as sources of carbon and energy. As a result of this, 
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BVMOs display remarkably broad substrate acceptance, high 
enantioselectivies, and great stereoselctivity and regie-selectivity 
(Mihovilovic et al. J. Org. Chem. 66:733-738 (2001). Suitable substrates 
for the enzymes can be broadly classified as cyclic ketones, ketoterpenes, 

5 and steroids. However, few enzymes have been subjected to extensive 
biochemical characterization. Key studies in relation to each broad ketone 
substrate class are summarized below. 

1 . Cyclic ketones: Activity of cyclohexanone monooxygenase 
upon cyclic ketone substrates in Acinetobacter sp. NCIB 9871 has been 

10 studied extensively (reviewed in Stewart, Curr. Org. Chem. 2:195-216 
(1998), Table 2; Walsh and Chen, Angew.Chem.lnt.Ed. Engl 27:333-343 
(1988), Tables 4-5) . Specificity has also been biochemically analyzed in 
Brevibacterium sp. HCU (Brzostowicz et al., J. Bad. 1 82(1 5):4241 -4248 
(2000)). 

15 2. Ketoterpenes: A monocyclic monoterpene ketone 

monooxygenase has been characterized from Rhodococcus erythropolis 
DCL14 (Van der Werf, J. Biochem. 347:693-701 (2000)). In addition to 
broad substrate specificity against ketoterpenes, the enzyme also has 
activity against substituted cyclohexanones. . 

20 3. Steroids: The steroid monooxygenase of Rhodococcus 

rhodochrous (Morii et al. J. Biochem 126:624-631 (1999)) is well 
characterized, both biochemically and by sequence data. 

The genes and gene products listed above are useful for specific 
Baeyer-Villiger reactions targeted toward cyclic ketone, ketoterpene, or 

25 steroid compounds, however the enzymes are limited in their ability to 
predict other newly discovered proteins which would have similar activity. 

The problem to be solved, therefore is to provide a suite of bacterial 
flavoprotein Baeyer-Villiger monooxygenase enzymes that can efficiently 
perform oxygenation reactions on cyclic ketones and ketoterpenes 

30 compounds. Identity of a suite of enzymes with this broad substrate 
acceptance would facilitate commercial applications of these enzymes 
and reduce efforts with respect to optimization of multiple enzymes for 
multiple reactions. Maximum efficiency is especially relevant today, when 
many enzymes are genetically engineered such that the enzyme is 

35 recombinantly expressed in a desirable host organism. Additionally, a 
collection of BVMO's with diverse amino acid sequences could be used to 
create a general predictive model based on amino acid sequence 
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conservation of other BVMO enzymes. Finally, a broad class of BVMO's 
could also be used as basis for the in vitro evolution of novel enzymes. 

Applicants have solved the stated problem by isolating several 
novel organisms with BVMO activity, identifying and characterizing BMVO 
5 genes, expressing these genes in microbial hosts, and demonstrating 
activity of the genes against a wide range of ketone substrates, including 
cyclic ketones and ketoterpenes. Several signature sequences have been 
identified, based on amino acid sequence alignments, which are 
characteristic of specific BVMO families and have diagnostic utility. 
10 SUMMARY OF THE INVENTION 

The invention provides an isolated nucleic acid fragment isolated 
from Rhodococcus selected from the group consisting of: 

(a) an isolated nucleic acid fragment encoding a Baeyer-Villiger 
monooxygenase polypeptide having an amino acid sequence selected 

15 from the group consisting of SEQ ID NOs:8, 10, 22, 24, 26, 28, 30, 32, 34, 
36, 38, 40, 42, 44, and 46. 

(b) an isolated nucleic acid molecule encoding a Baeyer-Villiger 
monooxygenase polypeptide that hybridizes with (a) under the following 
hybridization conditions: 0.1X SSC, 0.1% SDS, 65°C and washed with 2X 

20 SSC, 0.1 % SDS followed by 0.1X SSC, 0.1% SDS; or 

an isolated nucleic acid fragment that is complementary to (a) or 

(b). 

Similarly the invention provides an isolated nucleic acid fragment 
isolated from Arthrobacter selected from the group consisting of: 
25 (a) an isolated nucleic acid fragment encoding a Baeyer-Villiger 

monooxygenase polypeptide having an amino acid sequence as set forth 
in SEQ ID NO:12; 

(b) an isolated nucleic acid molecule encoding a Baeyer-Villiger 
monooxygenase polypeptide that hybridizes with (a) under the following 
30 hybridization conditions: 0.1X SSC, 0.1 % SDS, 65°C and washed with 2X 
SSC, 0.1% SDS followed by 0.1X SSC, 0.1% SDS; or 

an isolated nucleic acid fragment that is complementary to (a), or 

(b). 

Additionally the invention provides an isolated nucleic acid 
35 fragment isolated from Acidovorax selected from the group consisting of: 

(a) an isolated nucleic acid fragment encoding a Baeyer-Villiger 
monooxygenase polypeptide having an amino acid sequence as set forth 
in SEQIDNO:18 
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(b) an isolated nucleic acid molecule encoding a Baeyer-Villiger 
monooxygenase polypeptide that hybridizes with (a) under the following 
hybridization conditions: 0.1X SSC, 0.1% SDS, 65°C and washed with 2X 
SSC, 0.1 % SDS followed by 0.1X SSC, 0.1 % SDS; or 
5 an isolated nucleic acid fragment that is complementary to (a), or 

(b). 

In additional embodiments the invention provides polypeptides 
encoded by the present sequences as well as genetic chimera of the 
present sequences and transformed hosts expressing the same. 
10 In a preferred embodiment the invention provides a method for the 

identification of a polypeptide having monooxygenase activity comprising: 

(a) obtaining the amino acid sequence of a polypeptide suspected 
of having monooxygenase activity; and 

(b) aligning the amino acid sequence of step (a) with the amino acid 
15 sequence of a Baeyer-Villiger monooxygenase consensus sequence 

selected from the group consisting of SEQ ID NO:47, SEQ ID NO:48 and 
SEQ ID NO:49, 

wherein where at least 80% of the amino acid residues at positions 
p1- p74 of SEQ ID NO:47, or at least 80% of the amino acid residues at 
20 p1-p76 of SEQ ID NO:48 or at least 80% of the amino acid residues of p1- 
p41 of SEQ ID NO:49 are completely conserved, the polypeptide of (a) is 
identified as having monooxygenase activity. 

In an alternate embodiment the invention provides a method for 
identifying a gene encoding a Baeyer-Villiger monooxygenase polypeptide 
25 comprising: 

(a) probing a genomic library with a nucleic acid fragment encoding 
a polypeptide wherein where at least 80% of the amino acid residues at 
positions p1- p74 of SEQ ID NO:47, or at least 80% of the amino acid 
residues at p1-p76 of SEQ ID NO:48 or at least 80% of the amino acid 

30 residues of p1-p41 of SEQ ID NO:49 are completely conserved; 

(b) identifying a DNA clone that hybridizes with a nucleic acid 
fragment of step (a); 

(c) sequencing the genomic fragment that comprises the clone 
identified in step (b), 

35 wherein the sequenced genomic fragment encodes a Baeyer- 

Villiger monooxygenase polypeptide. 

In a preferred embodiment the invention provides a method for the 
biotransformation of a ketone substrate to the corresponding ester, 



4 



WO 03/020890 



PCTAJS02/27549 



comprising: contacting a transformed host cell under suitable growth 
conditions with an effective amount of ketone substrate whereby the 
corresponding ester is produced, said transformed host cell comprising a 
nucleic acid fragment encoding an isolated nucleic acid fragment of any of 
5 the present nucleic acid sequences; under the control of suitable 
regulatory sequences. 

In an alternate embodiment the invention provides a method for the 
in vitro transformation of a ketone substrate to the corresponding ester, 
comprising: contacting a ketone substrate under suitable reaction 

10 conditions with an effective amount of a Baeyer-Villiger monooxygenase 
enzyme, the enzyme having an amino acid seqeunce selected from the 
group consisting of SEQ ID NOs:8, 10, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 44, and 46. 

Additionally the invention provides a mutated microbial gene 

15 encoding a protein having an altered biological activity produced by a 
method comprising the steps of: 

(i) digesting a mixture of nucleotide sequences with restriction 
endonucleases wherein said mixture comprises: 

a) a native microbial gene selected from the group consisting of 
20 SEQIDNOs:7,9, 11,13, 15, 17,19,21,23,25,27,29,31,33, 

35, 37, 39, 41, 43, and 45; 

b) a first population of nucleotide fragments which will hybridize 
to said native microbial sequence; 

c) a second population of nucleotide fragments which will not 
25 hybridize to said native microbial sequence; 

wherein a mixture of restriction fragments are produced; 

(ii) denaturing said mixture of restriction fragments; 

(iii) incubating the denatured said mixture of restriction fragments 
of step (ii) with a polymerase; 

30 (iv) repeating steps (ii) and (iii) wherein a mutated microbial gene is 

produced encoding a protein having an altered biological activity. 
Additionally the invention provides unique strains of Acidovorax sp. 
comprising the 16s rDNA sequence as set forth in SEQ ID NO:5, 
Arthrobacter sp. comprising the 16s rDNA sequence as set forth in 

35 SEQ ID NO:1 , and Rhodococcus sp. comprising the 1 6s rDNA 

sequence as set forth in SEQ ID NO:6. 
In another embodiment the invention provides an Acidovorax sp. 
comprising the 16s rDNA sequence as set forth in SEQ ID NO:5. 
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Additionally the invention provides an Arthrobacter sp. comprising the 16s 
rDNA sequence as set forth in SEQ ID NO:1 . Similarly the invention 
provides a Rhodococcus sp. comprising the 16s rDNA sequence as set 
forth in SEQ ID NO:6. 
5 Additionally the invention provides an isolated nucleic acid useful 

for the identification of a BV monooxygenase selected from the group 
consisting of SEQ ID 70-1 13. 

BRIEF DESCRIPTION OF THE DRAWINGS. 
AND SEQUENCE DESCRIPTIONS 
10 Figures 1 , 2, 3, 4, and 5 show chnB monooxygenase activity of 

Brevibacterium sp. HCU, >4c//?efobacter SE19, Rhodococcus sp. phil , 
Rhodococcus sp. phi2, Arthrobacter sp. BP2 and Acidovorax sp. CHX 
genes over-expressed in E. colt assayed against various ketone 
substrates. 

15 Figure 6 illustrates the signature sequences of the three BVMO 

groups based on the consensus sequences derived from the alignments 
of Figure 7, Figure 8 and Figure 9. 

Figure 7 shows a Clustal W alignment of a family of Baeyer-Villiger 
monoxygenases (Family 1) and the associated signature sequence. 
20 Figure 8 shows a Clustal W alignment of a family of Baeyer-Villiger 

monoxygenases (Family 2) and the associated signature sequence. 

Figure 9 shows a Clustal W alignment of a family of BC 
monoxygenases (Family 3) and the associated signature sequence. 
The invention can be more fully understood from the following 
25 detailed description and the accompanying sequence descriptions which 
form a part of this application. 

The following sequences conform with 37 C.F.R. 1 .821-1 .825 
("Requirements for Patent Applications Containing Nucleotide Sequences 
and/or Amino Acid Sequence Disclosures - the Sequence Rules") and 
30 consistent with World Intellectual Property Organization (WIPO) Standard 
ST.25 (1998) and the sequence listing requirements of the EPO and PCT 
(Rules 5.2 and 49.5(a-bis), and Section 208 and Annex C of the 
Administrative Instructions). The symbols and format used for nucleotide 
and amino acid sequence data comply with the rules set forth in 
35 37 C.F.R. §1.822. 

SEQ ID NOs:1-49 are full length genes or proteins as identified in 
Table 1. 

Table 1 
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Gene Name 


Organism 


Gene 
SEQ ID 
No 


Protein 
SEQ ID 
No 


16s rDNA sequence 


Arthmbacter sp. BP2 


1 




16s rDNA sequence 


Rhodococcus sp. phil 


2 


__ 


16s rDNA sequence 


Rhodococcus sp. phi2 


3 


_ 


16s rDNA sequence 


Brevibacterium sp. HCU 


4 




16s rDNA sequence 


Acidovorax sp. CHX 


5 




16s rDNA sequence 


Rhodococcus 
erythropolis AN12 


6 




chnB Monooxygenase 
phil 


Rhodococcus sp. phil 


7 


8 


chnB Monooxygenase 
phi2 


Rhodococcus sp. phi2 


9 


10 


chnB Monooxygenase 
BP2 


Arthmbacter sp. BP2 


11 


12 


chnB1 Monooxygenase 
HCU #1 


Brevibacten'um sp. HCU 


13 


14 


chnB2 Monooxygenase 
HCU #2 


Brevibacterium sp. HCU 


15 


16 


chnB Monooxygenase 
CHX 


Acidovorax sp. CHX 


17 


18 


chnB Monooxygenase 
SE19 


Acinetobacter sp. SEW 


19 


20 


ORF 8 chnB 
Monooxygenase (1413) 


Rhodococcus 
erythropolis AN1 2 


21 


22 


ORF 9 chnB 
Monooxygenase (1985) 


Rhodococcus 
erythropolis AN 12 


23 


24 


ORF 10 cfcnB 
Monooxygenase (1273) 


Rhodococcus 
erythropolis AN 12 


25 


26 


ORF 11 chnB 
Monooxygenase (2034) 


Rhodococcus 
erythropolis AN 12 


27 


28 


ORF \2chnB 
Monooxygenase (1870) 


Rhodococcus 
erythropolis AN 12 


29 


30 


ORF \ZchnB 
Monooxygenase (1861) 


Rhodococcus 
erythropolis AN12 


31 


32 


ORF 14 chnB 


Rhodococcus 


33 


34 
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Gene Name 


Organism 


Gene 
SEQ ID 
No 


Protein 
SEQ ID 
No 


Monooxygenase (2005) 


erythropolis AN12 






ORF 15 c/wB 
Monooxygenase (2035) 


Rhodococcus 
erythropolis AN12 


35 


36 


ORF 16 chnB 
Monooxygenase (2022) 


Rhodococcus 
erythropolis AN12 


37 


38 


ORF 17 chnB 
Monooxygenase (1976) 


Rhodococcus 
erythropolis AN12 


39 


40 


ORF18crm8 
Monooxygenase (1294) 


Rhodococcus 
erythropolis AN12 


41 


42 


ORF 19 chnB 
Monooxygenase (2082) 


Rhodococcus 
erythropolis AN12 


43 


44 


ORF 20 chnB 
Monooxygenase (2093) 


Rhodococcus 
erythropolis AN12 


45 


46 


Signature Sequence #1 


Consensus Sequence 




47 


Signature Sequence #2 


Consensus Sequence 




48 


Siqnature Sequence #3 


Consensus Sequence 




49 



SEQ ID NOs:50-62 are primers used for 16s rDNA sequencing. 
SEQ ID NO:63 describes a primer used for RT-PCR and out-PCR. 
SEQ ID NOs:64 and 65 are primers used for sequencing of inserts 
5 within pCR2.1 



SEQ ID NOs:66 and 67 are primers used to amplify monooxygenase 
genes from Acinetobacter sp. SE19. 

SEQ ID NOs:68-107 are primers used for amplification of full length 
10 Baeyer-Villiger monooxygenases. 

SEQ ID NOs:108-113 are primers used to screen cosmid libraries. 
DETAILED DESCRIPTION OF THE INVENTION 
The invention provides nucleic acid and amino acid sequences 
defining a group of Baeyer-Villiger monooxygenase enzymes. These 
15 enzymes have been found to have the ability to use a wide variety of 
ketone substrates that include two general classes of compounds, cyclic 
ketones and ketoterpenes. These enzymes are characterized by function 
as well as a series of diagnostic signature sequences. The enzymes may 
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be expressed recombinantly for the conversion of ketone substrates to the 
corresponding lactones or esters. 

In this disclosure, a number of terms and abbreviations are used. 
The following definitions are provided. 
5 "Open reading frame" is abbreviated ORF. 

"Polymerase chain reaction" is abbreviated PCR. 

"Gas Chromatography Mass spectrometry" is abbreviated GC-MS. 

"Baeyer-Villiger" is abbreviated BV. 

"Baeyer-Villiger monooxygenase" is abbreviated BVMO. 
10 The term "Baeyer-Villiger monooxygenase", refers to a bacterial 

enzyme that has the ability to oxidize a ketone substrate to the 
corresponding lactone or ester. 

The term "ketone substrate" includes a substrate for a Baeyer- 
Villiger monooxygenase that comprises a class of compounds which 
15 include cyclic ketones and ketoterpenes. Ketone substrates of the 
invention are defined by the general formula: 



20 wherein R and R-) are independently selected from substituted or 

unsubstituted phenyl, substituted or unsubstituted alkyl, substituted or 
unsubstituted alkenyl, or substituted or unsubstituted alkylidene. 

The term "alkyl" will mean a univalent group derived from alkanes 
by removal of a hydrogen atom from any carbon atom: C n H 2n+1 -. The 

25 groups derived by removal of a hydrogen atom from a terminal carbon 
atom of unbranched alkanes form a subclass of normal alkyl (n-alkyl) 
groups: H[CH 2 ] n -. The groups RCH 2 -, R 2 CH- (R not equal to H), and 
R 3 C- (R not equal to H) are primary, secondary and tertiary alkyl groups 
respectively. 

30 The term "alkenyl" will mean an acyclic branched or unbranched 

hydrocarbon having one carbon-carbon double bond and the general 
formula C n H 2n . Acyclic branched or unbranched hydrocarbons having 
more than one double bond are alkadienes, alkatrienes, etc. 

The term "alkylidene" will mean the divalent groups formed from 

35 alkanes by removal of two hydrogen atoms from the same carbon atom, 
the free valiances of which are part of a double bond (e.g. (CH 3 ) 2 C, also 
known as propan-2-ylidene). 
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As used herein, an "isolated nucleic acid molecule" is a polymer of 
RNA or DNA that is single- or double-stranded, optionally containing 
synthetic, non-natural or altered nucleotide bases. An isolated nucleic 
acid fragment in the form of a polymer of DNA may be comprised of one 

5 or more segments of cDNA, genomic DNA or synthetic DNA. 

A nucleic acid molecule is "hybridizable" to another nucleic acid 
molecule, such as a cDNA, genomic DNA, or RNA, when a single 
stranded form of the nucleic acid molecule can anneal to the other nucleic 
acid molecule under the appropriate conditions of temperature and 

10 solution ionic strength. Hybridization and washing conditions are well 
known and exemplified in Sambrook, J., Fritsch, E. F. and Maniatis, T. 
Molecular Cloning: A Laboratory Manual . Second Edition, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor (1989), particularly 
Chapter 1 1 and Table 11.1 therein (entirely incorporated herein by 

15 reference). The conditions of temperature and ionic strength determine 
the "stringency" of the hybridization. Stringency conditions can be 
adjusted to screen for moderately similar fragments, such as homologous 
sequences from distantly related organisms, to highly similar fragments, 
such as genes that duplicate functional enzymes from closely related 

20 organisms. Typical stringent hybridization conditions are for example, 
hybridization at 0.1 X SSC, 0.1% SDS, 65°C with a wash with 2X SSC, 
0.1% SDS followed by 0.1X SSC, 0.1% SDS. Generally post-hybridization 
washes determine stringency conditions. One set of preferred conditions 
uses a series of washes starting with 6X SSC, 0.5% SDS at room 

25 temperature for 1 5 min, then repeated with 2X SSC, 0.5% SDS at 45°C 
for 30 min, and then repeated twice with 0.2X SSC, 0.5% SDS at 50°C for 
30 min. A more preferred set of stringent conditions uses higher 
temperatures in which the washes are identical to those above except for 
the temperature of the final two 30 min washes in 0.2X SSC, 0.5% SDS 

30 was increased to 60°C. Another preferred set of highly stringent 
conditions uses two final washes in 0.1 X SSC, 0.1% SDS at 65°C. 
Hybridization requires that the two nucleic acids contain complementary 
sequences, although depending on the stringency of the hybridization, 
mismatches between bases are possible. The appropriate stringency for 

35 hybridizing nucleic acids depends on the length of the nucleic acids and 
the degree of complementation, variables well known in the art. The 
greater the degree of similarity or homology between two nucleotide 
sequences, the greater the value of Tm for hybrids of nucleic acids having 
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those sequences. The relative stability (corresponding to higher Tm) of 
nucleic acid hybridizations decreases in the following order: RNA:RNA, 
DNA:RNA, DNA:DNA. For hybrids of greater than 100 nucleotides in 
length, equations for calculating Tm have been derived (see Sambrook et 
5 a/., supra, 9.50-9.51 ). For hybridizations with shorter nucleic acids, i.e., 
oligonucleotides, the position of mismatches becomes more important, 
and the length of the oligonucleotide determines its specificity (see 
Sambrook et a/., supra, 1 1 .7-1 1 .8). In one embodiment the length for a , 
hybridizable nucleic acid is at least about 10 nucleotides. Preferable a 

10 minimum length for a hybridizable nucleic acid is at least about 

15 nucleotides; more preferably at least about 20 nucleotides; and most 
preferably the length is at least 30 nucleotides. Furthermore, the skilled 
artisan will recognize that the temperature and wash solution salt 
concentration may be adjusted as necessary according to factors such as 

15 length of the probe. 

The term "complementary" is used to describe the relationship 
between nucleotide bases that are capable to hybridizing to one another. 
For example, with respect to DNA, adenosine is complementary to 
thymine and cytosine is complementary to guanine. Accordingly, the 

20 instant invention also includes isolated nucleic acid fragments that are 
complementary to the complete sequences as reported in the 
accompanying Sequence Listing as well as those substantially similar 
nucleic acid sequences. 

The term "percent identity", as known in the art, is a relationship 

25 between two or more polypeptide sequences or two or more 

polynucleotide sequences, as determined by comparing the sequences. 
In the art, "identity" also means the degree of sequence relatedness 
between polypeptide or polynucleotide sequences, as the case may be, as 
determined by the match between strings of such sequences. "Identity" 

30 and "similarity" can be readily calculated by known methods, including but 
not limited to those described in: Computational Molecular Biology (Lesk, 
A. M., ed.) Oxford University Press, New York (1988); Biocomputinq: 
Informatics and Genome Projects (Smith, D. W., ed.) Academic Press, 
New York (1993); Computer Analysis of Sequence Data. Part I (Griffin, A. 

35 M., and Griffin, H. G., eds.) Humana Press, New Jersey (1 994); Sequence 
Analysis in Molecular Biology (von Heinje, G., ed.) Academic Press 
(1987); and Sequence Analysis Primer (Gribskov, M. and Devereux, J., 
eds.) Stockton Press, New York (1991 ). Preferred methods to determine 
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identity are designed to give the best match between the sequences 
tested. Methods to determine identity and similarity are codified in publicly 
available computer programs. Sequence alignments and percent identity 
calculations may be performed using the Megalign program of the 
5 LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison, 
Wl). Multiple alignment of the sequences was performed using the Clustal 
method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with 
the default parameters (GAP PENALTY=10, GAP LENGTH 
PENALTY=10). Default parameters for pairwise alignments using the 

10 Clustal method were KTUPLE 1 , GAP PENALTY=3, WINDOW=5 and 
DIAGONALS SAVED=5. 

Suitable nucleic acid fragments (isolated polynucleotides of the 
present invention) encode polypeptides that are at least about 70% 
identical, preferably at least about 80% identical to the amino acid 

15 sequences reported herein. Preferred nucleic acid fragments encode 
amino acid sequences that are about 85% identical to the amino acid 
sequences reported herein. More preferred nucleic acid fragments 
encode amino acid sequences that are at least about 90% identical to the 
amino acid sequences reported herein. .Most preferred are nucleic acid 

20 fragments that encode amino acid sequences that are at least about 95% 
identical to the amino acid sequences reported herein. Suitable nucleic 
acid fragments not only have the above homologies but typically encode a 
polypeptide having at least 50 amino acids, preferably at least 100 amino 
acids, more preferably at least 150 amino acids, still more preferably at 

25 least 200 amino acids, and most preferably at least 250 amino acids. 

"Codon degeneracy" refers to the nature in the genetic code 
permitting variation of the nucleotide sequence without effecting the amino 
acid sequence of an encoded polypeptide. Accordingly, the instant 
invention relates to any nucleic acid fragment that encodes all or a 

30 substantial portion of the amino acid sequence encoding the instant 
microbial polypeptides as set forth in SEQ ID NOs:8, 10, 12, 14, 16, 18, 
20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, and 46. The skilled 
artisan is well aware of the "codon-bias" exhibited by a specific host cell in 
usage of nucleotide codons to specify a given amino acid. Therefore, 

35 when synthesizing a gene for improved expression in a host cell, it is 
desirable to design the gene such that its frequency of codon usage 
approaches the frequency of preferred codon usage of the host cell. 
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"Synthetic genes" can be assembled from oligonucleotide building 
blocks that are chemically synthesized using procedures known to those 
skilled in the art. These building blocks are ligated and annealed to form 
gene segments which are then enzymatically assembled to construct the 
5 entire gene. "Chemically synthesized", as related to a sequence of DNA, 
means that the component nucleotides were assembled in vitro. Manual 
chemical synthesis of DNA may be accomplished using well established 
procedures, or automated chemical synthesis can be performed using one 
of a number of commercially available machines. Accordingly, the genes 

10 can be tailored for optimal gene expression based on optimization of 
nucleotide sequence to reflect the codon bias of the host cell. The skilled 
artisan appreciates the likelihood of successful gene expression if codon 
usage is biased towards those codons favored by the host. Determination 
of preferred codons can be based on a survey of genes derived from the 

15 host cell where sequence information is available. 

"Gene" refers to a nucleic acid fragment that expresses a specific 
protein, including regulatory sequences preceding (5* non-coding 
sequences) and following (3' non-coding sequences) the coding 
sequence. "Native gene" refers to a gene as found in nature with its own 

20 regulatory sequences. "Chimeric gene" refers to any gene that is not a 
native gene, comprising regulatory and coding sequences that are not 
found together in nature. Accordingly, a chimeric gene may comprise 
regulatory sequences and coding sequences that are derived from 
different sources, or regulatory sequences and coding sequences derived 

25 from the same source, but arranged in a manner different than that found 
in nature. "Endogenous gene" refers to a native gene in its natural 
location in the genome of an organism. A "foreign" gene refers to a gene 
not normally found in the host organism, but that is introduced into the 
host organism by gene transfer. Foreign genes can comprise native 

30 genes inserted into a non-native organism, or chimeric genes. A 
"transgene" is a gene that has been introduced into the genome by a 
transformation procedure. 

"Coding sequence" refers to a DNA sequence that codes for a 
specific amino acid sequence. "Suitable regulatory sequences" refer to 

35 nucleotide sequences located upstream (5' non-coding sequences), 
within, or downstream (3' non-coding sequences) of a coding sequence, 
and which influence the transcription, RNA processing or stability, or 
translation of the associated coding sequence. Regulatory sequences 
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may include promoters, translation leader sequences, introns, 
polyadenylation recognition sequences, RNA processing site, effector 
binding site and stem-loop structures. 

"Promoter" refers to a DNA sequence capable of controlling the 

5 expression of a coding sequence or functional RNA. In general, a coding 
sequence is located 3' to a promoter sequence. Promoters may be 
derived in their entirety from a native gene, or be composed of different 
elements derived from different promoters found in nature, or even 
comprise synthetic DNA segments. It is understood by those skilled in the 

10 art that different promoters may direct the expression of a gene in different 
tissues or cell types, or at different stages of development, or in response 
to different environmental or physiological conditions. Promoters which 
cause a gene to be expressed in most cell types at most times are 
commonly referred to as "constitutive promoters". It is further recognized 

15 that since in most cases the exact boundaries of regulatory sequences 
have not been completely defined, DNA fragments of different lengths 
may have identical promoter activity. 

The u 3' non-coding sequences" refer to DNA sequences located 
downstream of a coding sequence and include polyadenylation 

20 recognition sequences and other sequences encoding regulatory signals 
capable of affecting mRNA processing or gene expression. The 
polyadenylation signal is usually characterized by affecting the addition of 
polyadenylic acid tracts to the 3' end of the mRNA precursor. 
"RNA transcript" refers to the product resulting from RNA 

25 polymerase-catalyzed transcription of a DNA sequence. When the RNA 
transcript is a perfect complementary copy of the DNA sequence, it is. 
referred to as the primary transcript or it may be a RNA sequence derived 
from post-transcriptional processing of the primary transcript and is 
referred to as the mature RNA. "Messenger RNA (mRNA)" refers to the 

30 RNA that is without introns and that can be translated into protein by the 
cell. "cDNA" refers to a double-stranded DNA that.is complementary to 
and derived from mRNA. "Sense" RNA refers to RNA transcript that 
includes the mRNA and so can be translated into protein by the cell. 
"Antisense RNA" refers to a RNA transcript that is complementary to all or 

35 part of a target primary transcript or mRNA and that blocks the expression 
of a target gene (U.S. Patent No. 5,107,065; WO 9928508). The 
complementarity of an antisense RNA may be with any part of the specific 
gene transcript, i.e., at the 5' non-coding sequence, 3' non-coding 
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sequence, or the coding sequence. "Functional RNA" refers to antisense 
RNA, ribozyme RNA, or other RNA that is not translated yet has an effect 
on cellular processes. 

The term "operably linked" refers to the association of nucleic acid 

5 sequences on a single nucleic acid fragment so that the function of one is 
affected by the other. For example, a promoter is operably linked with a 
coding sequence when it is capable of affecting the expression of that 
coding sequence (i.e., that the coding sequence is under the 
transcriptional control of the promoter). Coding sequences can be 

10 operably linked to regulatory sequences in sense or antisense orientation. 
The term "expression", as used herein, refers to the transcription 
and stable accumulation of sense (mRNA) or antisense RNA derived from 
the nucleic acid fragment of the invention. Expression may also refer to 
translation of mRNA into a polypeptide. 

15 'Transformation" refers to the transfer of a nucleic acid fragment 

into the genome of a host organism, resulting in genetically stable 
inheritance. Host organisms containing the transformed nucleic acid 
fragments are referred to as "transgenic" or "recombinant" or 
"transformed" organisms. . 

20 The terms "plasmid", "vector and "cassette" refer to an extra 

chromosomal element often carrying genes which are not part of the 
central metabolism of the cell, and usually in the form of circular double- 
stranded DNA molecules. Such elements may be autonomously 
replicating sequences, genome integrating sequences, phage or 

25 nucleotide sequences, linear or circular, of a single- or double-stranded 
DNA or RNA, derived from any source, in which a number of nucleotide 
sequences have been joined or recombined into a unique construction 
which is capable of introducing a promoter fragment and DNA sequence 
for a selected gene product along with appropriate 3' untranslated 

30 sequence into a cell. Transformation cassette" refers to a specific vector 
containing a foreign gene and having elements in addition to the foreign 
gene that facilitate transformation of a particular host cell. "Expression 
cassette" refers to a specific vector containing a foreign gene and having 
elements in addition to the foreign gene that allow for enhanced 

35 expression of that gene in a foreign host. 

The term "sequence analysis software" refers to any computer 
algorithm or software program that is useful for the analysis of nucleotide 
or amino acid sequences. "Sequence analysis software" may be 
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commercially available or independently developed. Typical sequence 
analysis software will include but is not limited to the GCG suite of 
programs (Wisconsin Package Version 9.0, Genetics Computer Group 
(GCG), Madison, Wl), BLASTP, BLASTN, BLASTX (Altschul era/., J. Mol. 
5 Biol. 21 5:403-41 0 (1 990), and DNASTAR (DNASTAR, Inc. 1 228 S. Park 
St. Madison, Wl 53715 USA), and the FASTA program incorporating the 
Smith-Waterman algorithm (W. R. Pearson, Comput. Methods Genome 
Res., [Proc. Int. Symp.] (1994), Meeting Date 1992, 111-20. Editor(s): 
Suhai, Sandor. Publisher Plenum, New York, NY). Within the context of 

10 this application it will be understood that where sequence analysis 

software is used for analysis, that the results of the analysis will be based 
on the "default values" of the program referenced, unless otherwise 
specified. As used herein "default values" will mean any set of values or 
parameters which originally load with the software when first initialized. 

15 The term "signature sequence" means a set of amino acids 

conserved at specific positions along an aligned sequence of 
evolutionarily related proteins. While amino acids at other positions can 
vary between homologous proteins, amino acids which are highly 
conserved at specific positions indicate amino acids which are essential in 

20 the structure/the stability, or the activity of a protein. Because they are 
identified by their high degree of conservation in aligned sequences of a 
family of protein homologues, they can be. used as identifiers, or 
"signatures", to determine if a protein with a newly determined sequence 
belongs to a previously identified protein family. Signature sequences of 

25 the present invention are specifically described Figure 6 showing the 
signature sequence comprised of p1-p74 of SEQ ID NO:47, p1-p76 of 
SEQ ID NO:48 and p1-p41 of SEQ ID NO:49. 

Standard recombinant DNA and molecular cloning techniques used 
here are well known in the art and are described by Sambrook, J., Fritsch, 

30 E. F. and Maniatis, T., Molecular Cloning: A Laboratory Manual . Second 
Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
(1989) (hereinafter "Maniatis"); and by Silhavy, T. J., Bennan, M. L. and 
Enquist, L. W., Experiments with Gene Fusions . Cold Spring Harbor 
Laboratory Cold Press Spring Harbor, NY (1984); and by Ausubel, F. M. et 

35 a/., Current Protocols in Molecular Biology , published by Greene 
Publishing Assoc. and Wiley-lnterscience (1987). 
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Isolation Of Mcroorganjsm s Having Baftver-Villiaer Monooxygenase 
Activity 

Microorganisms having Baeyer-Villiger monooxygenase activity 
may be isolated from a variety of sources. Suitable sources include 
industrial waste streams, soil from contaminated industrial sites and waste 
stream treatment facilities. The Baeyer-Villiger monooxygenase 
containing microorganisms of the instant invention were isolated from 
activated sludge from waste water treatment plants. 

Samples suspected of containing a microorganism having Baeyer- 
Villiger monooxygenase activity may be enriched by incubation in a 
suitable growth medium in combination with at least one ketone substrate. 
Suitable ketone substrates for use in the instant invention include cyclic 
ketones and ketoterpenes having the general formula: 



wherein R and R 1 are independently selected from substituted or 
unsubstituted phenyl, substituted or unsubstituted alkyl, or substituted or 
unsubstituted alkenyl or substituted or unsubstituted alkylidene. These 
20 compounds may be synthetic or natural secondary metabolites 

Particularly useful ketone substrates include, but are not limited to 
Norcamphor, Cyclobutanone, Cyclopentanone, 2-methyl-cyclopentanone, 
Cyclohexanone, 2-methyl-cyclohexanone, Cyclohex-2-ene-1-one, 1,2- 
cyclohexanedione, 1 ,3-cyclohexanedione, 1 ,4-cyclohexanedione,' 

25 Cycloheptanone, Cyclooctanone, Cyclodecanone, Cycloundecanone, 
Cyclododecanone, Cyclotridecanone, Cyclopenta-decanone, 2- 
tridecanone, dihexyl ketone, 2-phenyl-cyclohexanone, Oxindole, 
Levoglucosenone, dimethyl sulfoxide, dimethy-2-piperidone, 
Phenylboronic acid, and beta-ionone.Growth medium and techniques 

30 needed in the enrichment and screening of microorganisms are well 
known in the art and examples may be found in Manual of Methods for 
General Bacteriology (Phillipp Gerhardt, R. G. E. Murray, Ralph N. 
Costilow, Eugene W. Nester, Willis A. Wood, Noel R. Krieg and G. Briggs 
Phillips, eds), American Society for Microbiology, Washington, DC. 

35 (1994)); or by Thomas D. Brock in Biotechnology: A Textbook nf Industrial 
Microbiology , Second Edition, Sinauer Associates. Inc., Sunderland MA 
(1989). 
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Characterization of the Baever-Villiaer Monooxvaenase C ontaining 
Microorganisms: 

The sequence of the small subunit ribosomal RNA or DNA (1 6S 
rDNA) is frequently used for taxonomic identification of novel bacterial. 

5 Currently, more than 7,000 bacterial 16S rDNA sequences are now 

available. Highly conserved regions of the 16S rDNA provide priming sites 
for broad-range polymerase chain reaction (PCR) (or RT-PCR) and 
obviate the need for specific information about a targeted microorganism 
before this procedure. This permits identification of a previously 

10 uncharacterized bacterium by broad range bacterial 16S rDNA 
amplification, sequencing, and phylogenetic analysis. 

This invention describes the isolation and identification of 7 
different bacteria based on their taxonomic identification following 
amplification of the 16S rDNA using primers corresponding to conserved 

15 regions of the 16S rDNA molecule (Amann, R.I. et al. Microbiol. Rev. 
59(1): 143-69 (1995); Kane, M.D. et al. Appl. Environ. Microbiol. 59:682- 
686 (1993)), followed by sequencing and BLAST analysis (Basic Local 
Alignment Search Tool; Altschul, S. F., ef al., J. Mol. Biol. 215:403-410 
(1993); see also www.ncbi.nlm.nih.gov/BLAST/). Bacterial strains were 

20 identified as highly homologous to bacteria of the genera Brevibactehum, 
Arthrobacter, Acinetobacter, Acidovorax, and Rhodococcus. 

Comparison of the 16S rRNA nucleotide base sequence from strain 
AN 12 to public databases reveals that the most similar known sequences 
(98% homologous) are the 16S rRNA gene sequences of bacteria 

25 belonging to the genus Rhodococcus. 

Comparison of the 16S rRNA nucleotide base sequence from strain 
CHX to public databases reveals that the most similar known sequences 
(97% homologous) are the 16S rRNA gene sequences of bacteria of the 
genus Acidovorax. 

30 Comparison of the 1 6S rRNA nucleotide base sequence from strain 

BP2 to public databases reveals that the most similar known sequences 
(99% homologous) are the 1 6S rRNA gene sequences of bacteria of the 
genus Arthrobacter. Comparison of the 16S rRNA nucleotide base 
sequence from strain SE19 to public databases reveals that the most 

35 similar known sequences (99% homologous) are the 1 6S rRNA gene 
sequences of bacteria of the genus Acinetobacter. 

Comparison of the 16S rRNA nucleotide base sequence from 
strains phil and phi2 to public databases reveals that the most similar 
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known sequences (99% homologous) are the 16S rRNA gene sequences 
of bacteria belonging to the genus Rhodococcus. 
Identification of Baever-Villiqer Monooxvaenase Homologs 

The present invention provides examples of Baeyer-Villiger 
5 monooxygenase genes and gene products having the ability to convert 
suitable ketone substrates comprising cyclic ketones and ketoterpenes to 
the corresponding lactone or ester. For example, genes encoding 
BVMO's have been isolated from Arthrobacter (SEQ ID NO:11), 
Brevibacterium (SEQ ID NOs:13 and 15), Acidovorax (SEQ ID NO:17), 

10 Acinetobacter (SEQ ID NO:1 9) , and Rhodococcus (SEQ ID NOs:7, 9, 21 , 
23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, and 45). 

Comparison of the /Irf/irabacfer sp. BP2 chnB nucleotide base and 
deduced amino acid sequences to public databases reveals that the most 
similar known sequences range from a distant as about 57% identical to 

15 the amino acid sequence of reported herein over length of 532 amino 
acids using a Smith-Waterman alignment algorithm (W. R. Pearson, 
supra). Preferred amino acid fragments are at least about 70% - 80% and 
more preferred amino acid fragments are at least about 80%-90% 
identical to the sequences herein. Most preferred are nucleic acid 

20 fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred chnB encoding nucleic acid 
sequences corresponding to the instant ORFs are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
sequences reported herein. More preferred chnB nucleic acid fragments 

25 are at least 90% identical to the sequences herein. Most preferred are 
chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

Comparison of the Acidovorax sp. CHX chnB nucleotide base and 
deduced amino acid sequences to public databases reveals that the most 

30 similar known sequences range from a distant as about 57% identical to 
the amino acid sequence of reported herein over length of 538 amino 
acids using a Smith-Waterman alignment algorithm (W. R. Pearson, 
supra). Preferred amino acid fragments are at least about 70% - 80% and 
more preferred amino acid fragments are at least about 80%-90% 

35 identical to the sequences herein. Most preferred are nucleic acid 
fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred chnB encoding nucleic acid 
sequences corresponding to the instant ORF's are those encoding active 
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proteins and which are at least 80% identical to the nucleic acid 
sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 
chnB nucleic acid fragments that are at least 95% identical to the nucleic 
5 acid fragments reported herein. 

Comparison of the Rhodococcus sp. phil chnB nucleotide base 
and deduced amino acid sequences to public databases reveals that the 
most similar known sequences range from a distant as about 55% 
identical to the amino acid sequence of reported herein over length of 542 

10 amino acids using a Smith-Waterman alignment algorithm (W. R. 

Pearson, supra). Preferred amino acid fragments are at least about 70% - 
80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 
acid fragments that are at least 95% identical to the amino acid fragments 

15 reported herein. Similarly, preferred chnB encoding nucleic acid 

sequences corresponding to the instant ORF's are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 

20 chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

Comparison of the Rhodococcus sp. .phi2 chnB nucleotide base 
and deduced amino acid sequences to public databases reveals that the 
most similar known sequences range from a distant as about 53% 

25 identical to the amino acid sequence of reported herein over length of 541 
amino acids using a Smith-Waterman alignment algorithm (W. R. 
Pearson, supra). Preferred amino acid fragments are at least about 70% - 
80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 

30 acid fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred chnB encoding nucleic acid 
sequences corresponding to the instant ORF's are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
sequences reported herein. More preferred chnB nucleic acid fragments 

35 are at least 90% identical to the sequences herein. Most preferred are 
chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

20 
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Comparison of the Rhodococcus erythropolis AN 12 ORF8 chnB 
nucleotide base and deduced amino acid sequences to public databases 
reveals that the most similar known sequences range from a distant as 
about 37% identical to the amino acid sequence of reported herein over 
5 length of 439 amino acids using a Smith-Waterman alignment algorithm 
(W. R. Pearson, supra). Preferred amino acid fragments are at least about 
70% - 80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 
acid fragments that are at least 95% identical to the amino acid fragments 
10 reported herein. Similarly, preferred chnB encoding nucleic acid 

sequences corresponding to the instant ORF's are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 

1 5 chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

Comparison of the Rhodococcus erythropolis AN1 ORF9 chnB 
nucleotide base and deduced amino acid sequences to public databases 
reveals that the most similar known sequences range from a distant as 

20 about 44% identical to the amino acid sequence of reported herein over 
length of 518 amino acids using a Smith-Waterman alignment algorithm 
(W. R. Pearson, supra). Preferred amino acid fragments are at least about 
70% - 80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 

25 acid fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred chnB encoding nucleic acid 
sequences corresponding to the instant ORF's are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
sequences reported herein. More preferred chnB nucleic acid fragments 

30 are at least 90% identical to the sequences herein. Most preferred are 
chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

Comparison of the Rhodococcus erythropolis AN1 ORF10 chnB 
nucleotide base and deduced amino acid sequences to public databases 

35 reveals that the most similar known sequences range from a distant as 
about 64% identical to the amino acid sequence of reported herein over 
length of 541 amino acids using a Smith-Waterman alignment algorithm. 
(W. R. Pearson, supra). Preferred amino acid fragments are at least about 



21 



WO 03/020890 



PCT/US02/27549 



70% - 80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 
acid fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred chnB encoding nucleic acid 
5 sequences corresponding to the instant ORF's are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 
chnB nucleic acid fragments that are at least 95% identical to the nucleic 

10 acid fragments reported herein. 

Comparison of the Rhodococcus erythropolis AN1 ORF11 chnB 
nucleotide base and deduced amino acid sequences to public databases 
reveals that the most similar known sequences range from a distant as 
about 65% identical to the amino acid sequence of reported herein over 

15 length of 462 amino acids using a Smith-Waterman alignment algorithm 
(W. R. Pearson, supra). Preferred amino acid fragments are at least about 
70% - 80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 
acid fragments that are at least 95% identical to the amino acid fragments 

20 reported herein. Similarly, preferred chnB encoding nucleic acid 

sequences corresponding to the instant ORF's are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 

25 chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

Comparison of the Rhodococcus erythropolis AN1 ORF12 chnB 
nucleotide base and deduced amino acid sequences to public databases 
reveals that the most similar known sequences range from a distant as 

30 about 45% identical to the amino acid sequence of reported herein over 
length of 523 amino acids using a Smith-Waterman alignment algorithm 
(W. R. Pearson, supra). Preferred amino acid fragments are at least about 
70% - 80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 

35 acid fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred chnB encoding nucleic acid 
sequences corresponding to the instant ORF's are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
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sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 
chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

5 Comparison of the Rhodococcus erythropolis AN1 ORF13 chnB 

nucleotide base and deduced amino acid sequences to public databases 
reveals that the most similar known sequences range from a distant as 
about 55% identical to the amino acid sequence of reported herein over 
length of 493 amino acids using a Smith-Waterman alignment algorithm 

10 (W. R. Pearson, supra). Preferred amino acid fragments are at least about 
70% - 80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 
acid fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred chnB encoding nucleic acid 

15 sequences corresponding to the instant ORFs are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 
chnB nucleic acid fragments that are at least 95% identical to the nucleic 

20 acid fragments reported herein. 

Comparison of the Rhodococcus erythropolis AN1 ORF14 chnB 
nucleotide base and deduced amino acid sequences to public databases 
reveals that the most similar known sequences range from a distant as 
about 51% identical to the amino acid sequence of reported herein over 

25 length of 539 amino acids using a Smith-Waterman alignment algorithm 
(W. R. Pearson, supra). Preferred amino acid fragments are at least about 
70% - 80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 
acid fragments that are at least 95% identical to the amino acid fragments 

30 reported herein. Similariy, preferred chnB encoding nucleic acid 

sequences corresponding to the instant ORF's are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 

35 chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

Comparison of the Rhodococcus erythropolis AN1 ORF15 chnB 
nucleotide base and deduced amino acid sequences to public databases 
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reveals that the most similar known sequences range from a distant as 
about 39% identical to the amino acid sequence of reported herein over 
length of 649 amino acids using a Smith-Waterman alignment algorithm 
(W. R. Pearson, supra). Preferred amino acid fragments are at least about 

5 70% - 80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 
acid fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred chnB encoding nucleic acid 
sequences corresponding to the instant ORF's are those encoding active 

10 proteins and which are at least 80% identical to the nucleic acid 

sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 
chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

15 Comparison of the Rhodococcus erythropolis AN 1 ORF16 chnB 

nucleotide base and deduced amino acid sequences to public databases 
reveals that the most similar known sequences range from a distant as 
about 43% identical to the amino acid sequence of reported herein over 
length of 494 amino acids using a Smith-Waterman alignment algorithm 

20 (W. R. Pearson, supra). Preferred amino acid fragments are at least about 
70% - 80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 
acid fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred chnB encoding nucleic acid 

25 sequences corresponding to the instant ORF's are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 
chnB nucleic acid fragments that are at least 95% identical to the nucleic 

30 acid fragments reported herein. 

Comparison of the Rhodococcus erythropolis AN1 ORF17 chnB 
nucleotide base and deduced amino acid sequences to public databases 
reveals that the most similar known sequences range from a distant as 
about 53% identical to the amino acid sequence of reported herein over 

35 length of 499 amino acids using a Smith-Waterman alignment algorithm 
(W. R. Pearson, supra). Preferred amino acid fragments are at least about 
70% - 80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 
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acid fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred chnB encoding nucleic acid 
sequences corresponding to the instant ORF's are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
5 sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 
chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

Comparison of the Rhodococcus erythropolis AN1 ORF18 chnB 

10 nucleotide base and deduced amino acid sequences to public databases 
reveals that the most similar known sequences range from a distant as 
about 44% identical to the amino acid sequence of reported herein over 
length of 493 amino acids using a Smith-Waterman alignment algorithm 
(W. R. Pearson, supra). Preferred amino acid fragments are at least about 

15 70% - 80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 
acid fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred cnnfi encoding nucleic acid 
sequences corresponding to the instant ORF's are those encoding active 

20 proteins and which are at least 80% identical to the nucleic acid 

sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 
chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

25 Comparison of the Rhodococcus erythropolis AN1 ORF19 chnB 

nucleotide base and deduced amino acid sequences to public databases 
reveals that the most similar known sequences range from a distant as 
about 54% identical to the amino acid sequence of reported herein over 
length of 541 amino acids using a Smith-Waterman alignment algorithm 

30 (W. R. Pearson, supra). Preferred amino acid fragments are at least about 
70% - 80% and more preferred amino acid fragments are at least about 
80%-90% identical to the sequences herein. Most preferred are nucleic 
acid fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred chnB encoding nucleic acid 

35 sequences corresponding to the instant ORF's are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 
sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 
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chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

Comparison of the Rhodococcus erythropolis AN1 ORF20 chnB 
nucleotide base and deduced amino acid sequences to public databases 

5 reveals that the most similar known sequences range from a distant as 
about 42% identical to the amino acid sequence of reported herein over 
length of 545 amino acids using a Smith-Waterman alignment algorithm 
(W. R. Pearson, supra). Preferred amino acid fragments are at least about 
70% - 80% and more preferred amino acid fragments are at least about 

10 80%-90% identical to the sequences herein. Most preferred are nucleic 
acid fragments that are at least 95% identical to the amino acid fragments 
reported herein. Similarly, preferred chnB encoding nucleic acid 
sequences corresponding to the instant ORFs are those encoding active 
proteins and which are at least 80% identical to the nucleic acid 

15 sequences reported herein. More preferred chnB nucleic acid fragments 
are at least 90% identical to the sequences herein. Most preferred are 
chnB nucleic acid fragments that are at least 95% identical to the nucleic 
acid fragments reported herein. 

In addition to the identification of the above mentioned sequences 

20 and the biochemical characterization of the activity of the gene product, 
Applicants have made the discovery that many of these monooxygenase 
proteins share diagnostic signature sequences which may be used for the 
identification of other proteins having similar activity. For example, the 
present monooxygenases may be grouped into three general families 

25 based on sequence alignment. One group, referred to herein BV Family 1 , 
is comprised of the monooxygenase sequences shown in Figure 7 and 
generating the consensus sequence as set forth in SEQ ID NO:47. As will 
be seen in Figure 7, there are a group of completely conserved amino 
acids in 74 positions across all of the sequences of Figure 7. These 

30 positions are further delineated in Figure 6, and indicated as p1 - p74. 
Similarly, BV Family 2 is comprised of the monooxygenase 
sequences shown on Figure 8, and generating the consensus sequence 
as set forth in SEQ ID NO:48. The signature seqeunce of BV Family 2 
monooxygenases is shown in Figure 6 having the positions p1-p76. 

35 BV Family 3 monooxygenases are shown in Figure 9, generating the 

consensus sequence as set for the in SEQ ID NO:49, having the signature 
sequence as shown in Figure 6 of positions p1-p41 . 
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Although there is variation among the sequences of the various 
families, all of the individual members of these families have been shown 
to possess monooxygenase activity. Thus, it is contemplated that where a 
polypeptide possesses the signature sequences as defined in Figures 6-9 
5 that it will have monooxygenase activity. It is thus within the scope of the 
present invention to provide a method for identifying a gene encoding a 
Baeyer-Villiger monooxygenase polypeptide comprising: 

(a) probing a genomic library with a nucleic acid fragment 
encoding a polypeptide wherein where at least 80% of the 

10 amino acid residues at positions p1 - p74 of SEQ ID NO:47, 

or at least 80% of the amino acid residues at p1-p76 of 
SEQ ID NO:48 or at least 80% of the amino acid residues 
of p1-p41 of SEQ ID NO:49 are completely conserved; 

(b) identifying a DNA clone that hybridizes with a nucleic acid 
15 fragment of step (a); 

(c) sequencing the genomic fragment that comprises the 
clone identified in step (b), 

wherein the sequenced genomic fragment encodes a Baeyer- 
Villiger monooxygenase polypeptide. 

20 In a preferred embodiment the invention provides the above 

method wherein where at least 100% of the amino acid residues at 
positions p1- p74 of SEQ ID NO:47, or at least 100% of the amino acid 
residues at p1-p76 of SEQ ID NO:48 or at least 100% of the amino acid 
residues of p1-p41 of SEQ ID NO:49 are completely conserved. 

25 It will be appreciated that other Baeyer-Villiger monooxygenase 

genes having similar substrate specificity may be identified and isolated 
on the basis of sequence dependent protocols or according to alignment 
against the signature sequences disclosed herein. 

Isolation of homologous genes using sequence-dependent 

30 protocols is well known in the art. Examples of sequence-dependent 
protocols include, but are not limited to, methods of nucleic acid 
hybridization, and methods of DNA and RNA amplification as exemplified 
by various uses of nucleic acid amplification technologies (e.g polymerase 
chain reaction (PCR). Mullis etal., U.S. Patent 4,683,202), ligase chain 

35 reaction (LCR), Tabor, S. et a/., Proc. Acad. Sci. USA 82: 1 074, (1 985)) or 
strand displacement amplification (SDA, Walker, etal., Proc. Natl. Acad. 
Sci. U.S.A., 89: 392, (1992)). 
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For example, genes encoding similar proteins or polypeptides to 
the present Baeyer-Villiger monooxygenases could be isolated directly by 
using all or a portion of the nucleic acid fragments set forth in SEQ ID 
NOs:7, 9. 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 , 33, 35, 37, 39, 41 , 43, 

5 and 45 or as DNA hybridization probes to screen libraries from any 

desired bacteria using methodology well known to those skilled in the art. 
Specific oligonucleotide probes based upon the instant nucleic acid 
sequences can be designed and synthesized by methods known in the art 
(Maniatis, supra). Moreover, the entire sequences can be used directly to 

10 synthesize DNA probes by methods known to the skilled artisan such as 
random primers DNA labeling, nick translation, or end-labeling techniques, 
or RNA probes using available in vitro transcription systems. In addition, 
specific primers can be designed and used to amplify a part of or full- 
length of the instant sequences. The resulting amplification products can 

15 be labeled directly during amplification reactions or labeled after 

amplification reactions, and used as probes to isolate full length DNA 
fragments under conditions of appropriate stringency. 

Typically, in PCR-type primer directed amplification techniques, the 
primers have different sequences and are not complementary to each 

20 other. Depending on the desired test conditions, the sequences of the 
primers should be designed to provide for both efficient and faithful 
replication of the target nucleic acid. Methods of PCR primer design are 
common and well known in the art. (Thein and Wallace, The use of 
oligonucleotide as specific hybridization probes in the Diagnosis of 

25 Genetic Disorders", in Human Genetic Diseases: A Practical Approach, K. 
E. Davis Ed., (1986) pp. 33-50 IRL Press, Herndon, Virginia; Rychlik, W. 
(1993) In White, B. A. (ed.), Methods in Molecular Biology . Vol. 15, 
pages 31-39, PCR Protocols: Current Methods and Applications. 
Humania Press, Inc., Totowa, NJ.) 

30 Generally PCR primers may be used to amplify longer nucleic acid 

fragments encoding homologous genes from DNA or RNA. However, the 
polymerase chain reaction may also be performed on a library of cloned 
nucleic acid fragments wherein the sequence of one primer is derived 
from the instant nucleic acid fragments, and the sequence of the other 

35 primer takes advantage of the presence of the polyadenylic acid tracts to 
the 3' end of the mRNA precursor encoding microbial genes. 
Alternatively, the second primer sequence may be based upon sequences 
derived from the cloning vector. For example, the skilled artisan can 



28 



WO 03/020890 



PCT/US02/27549 



follow the RACE protocol (Frohman et a/., PNAS USA 85:8998 (1988)) to 
generate cDNAs by using PCR to amplify copies of the region between a 
single point in the transcript and the 3" or 5' end. Primers oriented in the 3' 
and 5' directions can be designed from the instant sequences. Using 

5 commercially available 3' RACE or 5' RACE systems (BRL), specific 3' or 
5" cDNA fragments can be isolated (Ohara et a/., PNAS USA 86:5673 
(1989); Loh et a/., Science 243:217 (1989)). 

Accordingly the invention provides a method for identifying a 
nucleic acid molecule encoding a Baeyer-Villiger monooxygenase 

10 comprising: (a) synthesizing at least one oligonucleotide primer 
corresponding to a portion of the sequence selected from the group 
consisting of SEQ ID NOs:7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 41 , 43, and 45 and (b) amplifying an insert present in a 
cloning vector using the oligonucleotide primer of step (a); wherein the 

15 amplified insert encodes a Baeyer-Villiger monooxygenase 

Alternatively the instant sequences may be employed as 
hybridization reagents for the identification of homologs. The basic 
components of a nucleic acid hybridization test include a probe, a sample 
suspected of containing the gene or gene fragment of interest, and a 

20 specific hybridization method. Probes of the present invention are 

typically single stranded nucleic acid sequences which are complementary 
to the nucleic acid sequences to be detected. Probes are "hybridizable" to 
the nucleic acid sequence to be detected. The probe length can vary from 
5 bases to tens of thousands of bases, and will depend upon the specific 

25 test to be done. Typically a probe length of about 1 5 bases to about 
30 bases is suitable. Only part of the probe molecule need be 
complementary to the nucleic acid sequence to be detected. In addition, 
the complementarity between the probe and the target sequence need not 
be perfect. Hybridization does occur between imperfectly complementary 

30 molecules with the result that a certain fraction of the bases in the 
hybridized region are not paired with the proper complementary base. 

Hybridization methods are well defined. Typically the probe and 
sample must be mixed under conditions which will permit nucleic acid 
hybridization. This involves contacting the probe and sample in the 

35 presence of an inorganic or organic salt under the proper concentration 
and temperature conditions. The probe and sample nucleic acids must be 
in contact for a long enough time that any possible hybridization between 
the probe and sample nucleic acid may occur. The concentration of probe 
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or target in the mixture will determine the time necessary for hybridization 
to occur. The higher the probe or target concentration the shorter the 
hybridization incubation time needed. Optionally a chaotropic agent may 
be added. The chaotropic agent stabilizes nucleic acids by inhibiting 

5 nuclease activity. Furthermore, the chaotropic agent allows sensitive and 
stringent hybridization of short oligonucleotide probes at room temperature 
[Van Ness and Chen (1991) Nucl. Acids Res. 19:5143-5151]. Suitable 
chaotropic agents include guanidinium chloride, guanidinium thiocyanate, 
sodium thiocyanate, lithium tetrachloroacetate, sodium perchlorate, 

10 rubidium tetrachloroacetate, potassium iodide, and cesium 

trifluoroacetate, among others. Typically, the chaotropic agent will be 
present at a final concentration of about 3M. If desired, one can add 
formamide to the hybridization mixture, typically 30-50% (v/v). 

Various hybridization solutions can be employed. Typically, these 

15 comprise from about 20 to 60% volume, preferably 30%, of a polar 

organic solvent. A common hybridization solution employs about 30-50% 
v/v formamide, about 0.15 to 1M sodium chloride, about 0.05 to 0.1 M 
buffers, such as sodium citrate, Tris-HCI, PIPES or HEPES (pH range 
about 6-9), about 0.05 to 0.2% detergent, such as sodium dodecylsulfate, 

20 or between 0.5-20 mM EDTA, FICOLL (Pharmacia Inc.) (about 

300-500 kilodaltons), polyvinylpyrrolidone (about 250-500 kdal), and 
serum albumin. Also included in the typical hybridization solution will be 
unlabeled carrier nucleic acids from about 0.1 to 5 mg/mL, fragmented 
nucleic DNA, e.g., calf thymus or salmon sperm DNA, or yeast RNA, and 

25 optionally from about 0.5 to 2% wt/vol glycine. Other additives may also 
be included, such as volume exclusion agents which include a variety of 
polar water-soluble or swellable agents, such as polyethylene glycol, 
anionic polymers such as polyacrylate or polymethylacrylate, and anionic 
saccharidic polymers, such as dextran sulfate. 

30 Thus, the invention provides a method for identifying a nucleic acid 

molecule encoding a Baeyer-Villiger monooxygenase comprising:(a) 
probing a genomic library with a portion of a nucleic acid molecule 
selected from the group consisting of SEQ ID NOs:7, 9, 1 1, 13, 15, 17, 19, 
21 , 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, and 45 ;(b) identifying a DNA 

35 clone that hybridizes under conditions of 0.1X SSC, 0.1% SDS, 65°C and 
washed with 2X SSC, 0.1% SDS followed by 0.1X SSC, 0.1 % SDS with 
the nucleic acid molecule of (a); and (c) sequencing the genomic fragment 
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that comprises the clone identified in step (b), wherein the sequenced 
genomic fragment encodes Baeyer-Villiger monooxygenase. 
Recombinant Expression-Microbial 

The genes and gene products of the present BVMO sequences 

5 may be introduced into microbial host cells. Preferred host cells for 
expression of the instant genes and nucleic acid molecules are microbial 
hosts that can be found broadly within the fungal or bacterial families and 
which grow over a wide range of temperature, pH values, and solvent 
tolerances. Because of transcription, translation and the protein 

10 biosynthetic apparatus is the same irrespective of the cellular feedstock, 
functional genes are expressed irrespective of carbon feedstock used to 
generate cellular biomass. Large scale microbial growth and functional 
gene expression may utilize a wide range of simple or complex 
carbohydrates, organic acids and alcohols, saturated hydrocarbons such 

15 as methane or carbon dioxide in the case of photosynthetic or 
chemoautotrophic hosts. However, the functional genes may be 
regulated, repressed or depressed by specific growth conditions, which 
may include the form and amount of nitrogen, phosphorous, sulfur, 
oxygen, carbon or any trace micronutrient including small inorganic ions. 

20 In addition, the regulation of functional genes may be achieved by the 
presence or absence of specific regulatory molecules that are added to 
the culture and are not typically considered nutrient or energy sources. 
Growth rate may also be an important regulatory factor in gene 
expression. Examples of suitable host strains include but are not limited 

25 to fungal or yeast species such as Aspergillus, Trichoderma, 

Saccharomyces, Pichia, Candida, Hansenula, or bacterial species such 
as member of the proteobacteria and actinomycetes as well as the 
specific genera Rhodococcus, Acinetobacter, Arthrobacter, Mycobacteria, 
Nocardia, Brevibacterium, Acidovorax, Bacillus, Streptomyces, 

30 Escherichia, Salmonella, Pseudomonas, Aspergillus, Saccharomyces, 
Pichia, Candida, Cornyebacterium, and Hansenula. 

Particularly suitable in the present invention as hosts for 
monooxygenase are the members of the Proteobacteria and 
Actinomycetes. The Proteobacteria form a physiologically diverse group of 

35 microorganisms and represent five subdivisions (a, p, y, e, 8) (Madigan et 
al., Brock Biology of Microorganisms , 8th edition, Prentice Hall, 
UpperSaddle River, NJ (1997)). All five subdivisions of the Proteobacteria 
contain microorganisms that use organic compounds as sources of 
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carbon and energy. Members of the Proteobacteria suitable in the present 
invention include, but are not limited to Burkholderia, Alcaligenes, 
Pseudomonas, Sphingomonas, Pandoraea, Delftia and Comamonas. 
Microbial expression systems and expression vectors containing 

5 regulatory sequences that direct high level expression of foreign proteins 
are well known to those skilled in the art. Any of these could be used to 
construct chimeric genes for production of the any of the gene products of 
the instant sequences. These chimeric genes could then be introduced 
into appropriate microorganisms via transformation to provide high level 

10 expression of the enzymes. 

Vectors or cassettes useful for the transformation of suitable host 
cells are well known in the art. Typically the vector or cassette contains 
sequences directing transcription and translation of the relevant gene, a 
selectable marker, and sequences allowing autonomous replication or 

15 chromosomal integration. Suitable vectors comprise a region 5' of the 
gene which harbors transcriptional initiation controls and a region 3* of the 
DNA fragment which controls transcriptional termination. It is most 
preferred when both control regions are derived from genes homologous 
to the transformed host cell, although it is to be understood that such 

20 control regions need not be derived from the genes native to the specific 
species chosen as a production host. 

Initiation control regions or promoters, which are useful to drive 
expression of the instant ORF's in the desired host cell are numerous and 
familiar to those skilled in the art. Virtually any promoter capable of driving 

25 these genes is suitable for the present invention including but not limited 
to CYC1, HIS3, GAL1, GAL10, ADH1, PGK, PH05, GAPDH, ADC1, 
TRP1, URA3, LEU2, ENO, TPI (useful for expression in Saccharomyces); 
AOX1 (useful for expression in Pichia); and lac, ara, tet, trp, IP b IP R , 17, 
tac, and trc (useful for expression in Escherichia coli) as well as the amy, 

30 apr, npr promoters and various phage promoters useful for expression in 
Sac/7/us. 

Termination control regions may also be derived from various 
genes native to the preferred hosts. Optionally, a termination site may be 
unnecessary, however, it is most preferred if included. 
35 Recombinant Expression-Plants 

The sequences encoding the BVMO's of the present invention may 
be used to create transgenic plants having the ability to express the 
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microbial proteins. Preferred plant hosts will be any variety that will 
support a high production level of the instant proteins. 

Suitable green plants will included but are not limited to of soybean, 
rapeseed (Brassica napus, B. campestris), sunflower (Helianthus annus), 

5 cotton (Gossypium hirsutum), com, tobacco (Nicotiana tabacum), alfalfa 
(Medicago sativa), wheat (Triticum sp), barley (Hordeum vulgare), oats 
(Avena sativa, L), sorghum (Sorghum bicolor), rice (Oryza sativa), 
Arabidopsis, cruciferous vegetables (broccoli, cauliflower, cabbage, 
parsnips, etc.), melons, carrots, celery, parsley, tomatoes, potatoes, 

10 strawberries, peanuts, grapes, grass seed crops, sugar beets, sugar cane, 
beans, peas, rye, flax, hardwood trees, softwood trees, and forage 
grasses. Algal species include but not limited to commercially significant 
hosts such as Spirulina and Dunalliela. Overexpression of the proteins of 
the instant invention may be accomplished by first constructing chimeric 

15 genes in which the coding region are operably linked to promoters capable 
of directing expression of a gene in the desired tissues at the desired 
stage of development. For reasons of convenience, the chimeric genes 
may comprise promoter sequences and translation leader sequences 
derived from the same genes. 3' Non-coding sequences encoding 

20 transcription termination signals must also be provided. The instant 
chimeric genes may also comprise one or more introns in order to 
facilitate gene expression. 

Any combination of any promoter and- any terminator capable of 
inducing expression of a coding region may be used in the chimeric 

25 genetic sequence. Some suitable examples of promoters and terminators 
include those from nopaline synthase (nos), octopine synthase (ocs) and 
cauliflower mosaic virus {CaMV) genes. One type of efficient plant 
promoter that may be used is a high level plant promoter. Such 
promoters, in operable linkage with the genetic sequences or the present 

30 invention should be capable of promoting expression of the present gene 
product. High level plant promoters that may be used in this invention 
include the promoter of the small subunit (ss) of the ribulose-1,5- 
bisphosphate carboxylase from example from soybean (Berry-Lowe etai, 
J. Molecular and App. Gen., 1:483-498 1982)), and the promoter of the 

35 chlorophyll a/b binding protein. These two promoters are known to be 
light-induced in plant cells (See, for example, Genetic Eng ineering of 
Plants, an Agricultural Perspective . A. Cashmore, Plenum, New York 
(1983), pages 29-38; Coruzzi, G. etai, The Journal of Biological 
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Chemistry, 258:1399 (1983), and Dunsmuir, P. etal., Journal of Molecular 
and Applied Genetics, 2:285 (1983)). 

Plasmid vectors comprising the instant chimeric genes can then be 
constructed. The choice of plasmid vector depends upon the method that 

5 will be used to transform host plants. The skilled artisan is well aware of 
the genetic elements that must be present on the plasmid vector in order 
to successfully transform, select and propagate host cells containing the 
chimeric gene. The skilled artisan will also recognize that different 
independent transformation events will result in different levels and 

10 patterns of expression (Jones et a/., EMBO J. 4:241 1 -241 8 (1 985); 
De Almeida et a/., Mol. Gen. Genetics 218:78-86 (1989)), and thus that 
multiple events must be screened in order to obtain lines displaying the 
desired expression level and pattern. Such screening may be 
accomplished by Southern analysis of DNA blots (Southern, J. Mol. Biol. 

15 98:503, (1975)). Northern analysis of mRNA expression (Kroczek, J. 
Chromatogr. Biomed. Appl., 618 (1-2):133-145 (1993)), Western analysis 
of protein expression, or phenotypic analysis. 

For some applications it will be useful to direct the instant proteins 
to different cellular compartments. It is thus envisioned that the chimeric 

20 genes described above may be further supplemented by altering the 
coding sequences to encode enzymes with appropriate intracellular 
targeting sequences such as transit sequences (Keegstra, K., Cell 
56:247-253 (1989)), signal sequences or sequences encoding 
endoplasmic reticulum localization (Chrispeels, J.J., Ann. Rev. Plant Phys. 

25 Plant Mol. Biol. 42:21 -53 (1 991 )), or nuclear localization signals (Raikhel, 
N. Plant Phys. 100: 1627-1 632 (1992)) added and/or with targeting 
sequences that are already present removed. While the references cited 
give examples of each of these, the list is not exhaustive and more 
targeting signals of utility may be discovered in the future that are useful in 

30 the invention. 

Process for the Production of Lactones and Esters from Ketone Substrates 
Once the appropriate nucleic acid sequence has been expressed in a 
recombinant organism, the organism may be contacted with a suitable ketone 
substrate for the production of the corresponding ester. The Baeyer-Villiger 

35 monooxygenases of the instant invention will act on a variety of ketone 
substrates comprising cyclic ketones and ketoterpenes to produce the 
corresponding lactone or ester. Suitable ketone substrates for the conversion 
to esters are defined by the general formula: 
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O 

II 

R C 

wherein R and are independently selected from substituted or 
unsubstituted phenyl, substituted or unsubstituted alkyl, or substituted or 
unsubstituted alkenyl or substituted or unsubstituted alkylidene. 
Particularly useful ketone substrates include, but are not limited to 
Norcamphor, Cyclobutanone, Cyclopentanone, 2-methyl-cyclopentanone, 
Cyclohexanone, 2-methyl-cyclohexanone, Cyclohex-2-ene-1-one, 1,2- 
cyclohexanedione, 1 ,3-cyclohexanedione, 1 ,4-cyclohexanedione, 
Cycloheptanone, Cyclooctanone, Cyclodecanone, Cycloundecanone, 
Cyclododecanone, Cyclotridecanone, Cyclopenta-decanone, 2- 
tridecanone, dihexyl ketone, 2-phenyi-cyclohexanone, Oxindole, 
Levoglucosenone, dimethyl sulfoxide, dimethy-2-piperidone, 
Phenylboronic acid, and beta-ionone. 

Alternatively it is contemplated that the enzymes of the invention 
may be used in vitro for the transformation of ketone substrates to the 
corresponding esters. The monooxygenase enzymes may be produced 
recombinantly or isoalted from native sources, purified and reacted with 
the appropriate substrate under suitalbe conditions of pH and 
temperature. ' 

Where large scale commercial production of lactones or esters is 
desired, a variety of culture methodologies may be applied. For example, 
large scale production from a recombinant microbial host may be 
produced by both batch or continuous culture methodologies. 

A classical batch culturing method is a closed system where the 
composition of the media is set at the beginning of the culture and not 
subject to artificial alterations during the culturing process. Thus, at the 
beginning of the culturing process the media is inoculated with the desired 
organism or organisms and growth or metabolic activity is permitted to 
occur adding nothing to the system. Typically, however, a "batch" culture 
is batch with respect to the addition of carbon source and attempts are 
often made at controlling factors such as pH and oxygen concentration. In 
batch systems the metabolite and biomass compositions of the system 
change constantly up to the time the culture is terminated. Within batch 
cultures cells moderate through a static lag phase to a high growth log 
phase and finally to a stationary phase where growth rate is diminished or 



35 



WO 03/020890 



PCT/US02/27549 



halted. If untreated, cells in the stationary phase will eventually die. Cells 
in log phase are often responsible for the bulk of production of end 
product or intermediate in some systems. Stationary or post-exponential 
phase production can be obtained in other systems. 

5 A variation on the standard batch system is the Fed-Batch system. 

Fed-Batch culture processes are also suitable in the present invention and 
comprise a typical batch system with the exception that the substrate is 
added in increments as the culture progresses. Fed-Batch systems are 
useful when catabolite repression is apt to inhibit the metabolism of the 

10 cells and where it is desirable to have limited amounts of substrate in the 
media. Measurement of the actual substrate concentration in Fed-Batch 
systems is difficult and is therefore estimated on the basis of the changes 
of measurable factors such as pH, dissolved oxygen and the partial 
pressure of waste gases such as C0 2 . Batch and Fed-Batch culturing 

15 methods are common and well known in the art and examples may be 
found in Thomas D. Brock in Biotechnology: A Textbook of Industrial 
Microbiology . Second Edition (1989) Sinauer Associates, Inc., 
Sunderland, MA., or Deshpande, Mukund V., Appl. Biochem. BiotechnoL, 
36, 227, (1992), herein incorporated by reference. 

20 Commercial production of lactones and esters of the present 

invention may also be accomplished with a continuous culture. 
Continuous cultures are an open system where a defined culture media is 
added continuously to a bioreactor and an equal amount of conditioned 
media is removed simultaneously for processing. Continuous cultures 

25 generally maintain the cells at a constant high liquid phase density where 
cells are primarily in log phase growth. Alternatively continuous culture 
may be practiced with immobilized cells where carbon and nutrients are 
continuously added, and valuable products, by-products or waste products 
are continuously removed from the cell mass. Cell immobilization may be 

30 performed using a wide range of solid supports composed of natural 
and/or synthetic materials. 

Continuous or semi-continuous culture allows for the modulation of 
one factor or any number of factors that affect cell growth or end product 
concentration. For example, one method will maintain a limiting nutrient 

35 such as the carbon source or nitrogen level at a fixed rate and allow all 
other parameters to moderate. In other systems a number of factors 
affecting growth can be altered continuously while the cell concentration, 
measured by media turbidity, is kept constant. Continuous systems strive 
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to maintain steady state growth conditions and thus the cell loss due to 
media being drawn off must be balanced against the cell growth rate in 
the culture. Methods of modulating nutrients and growth factors for 
continuous culture processes as well as techniques for maximizing the 

5 rate of product formation are well known in the art of industrial 
microbiology and a variety of methods are detailed by Brock, supra. 
Baever-Villiaer monooxyqenases having enhanced activity 

It is contemplated that the present BVMO sequences may be used 
to produce gene products having enhanced or altered activity. Various 

10 methods are known for mutating a native gene sequence to produce a 
gene product with altered or enhanced activity including but not limited to 
error prone PCR (Melnikov etal., Nucleic Acids Research, (Feb. 15, 1999) 
Vol. 27, No. 4, pp. 1056-1062); site directed mutagenesis (Coombs etal., 
Proteins (1998), 259-311, 1 plate. Editor(s): Angeletti, Ruth Hogue. 

15 Publisher: Academic, San Diego, CA) and "gene shuffling" (US 5,605,793; 
US 5,811,238; US 5,830,721; and US 5,837,458, incorporated herein by 
reference). 

The method of gene shuffling is particularly attractive due to its 
facile implementation, and high rate of mutagenesis and ease of 

20 screening. The process of gene shuffling involves the restriction 

endonuclease cleavage of a gene of interest into fragments of specific 
size in the presence of additional populations of DNA regions of both 
similarity to or difference to the gene of interest. This pool of fragments 
will then be denatured and reannealed to create a mutated gene. The 

25 mutated gene is then screened for altered activity. 

The BVMO sequences of the present invention may be mutated 
and screened for altered or enhanced activity by this method. The 
sequences should be double stranded and can be of various lengths 
ranging form 50 bp to 10 kb. The sequences may be randomly digested 

30 into fragments ranging from about 1 0 bp to 1 000 bp, using restriction 
endonucleases well known in the art (Maniatis supra). In addition to the 
instant microbial sequences, populations of fragments that are 
hybridizable to all or portions of the microbial sequence may be added. 
Similarly, a population of fragments which are not hybridizable to the 

35 instant sequence may also be added. Typically these additional fragment 
populations are added in about a 10 to 20 fold excess by weight as 
compared to the total nucleic acid. Generally if this process is followed 
the number of different specific nucleic acid fragments in the mixture will 
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be about 100 to about 1000. The mixed population of random nucleic 
acid fragments are denatured to form single-stranded nucleic acid 
fragments and then reannealed. Only those single-stranded nucleic acid 
fragments having regions of homology with other single-stranded nucleic 
acid fragments will reanneal. The random nucleic acid fragments may be 
denatured by heating. One skilled in the art could determine the 
conditions necessary to completely denature the double stranded nucleic 
acid. Preferably the temperature is from 80°C to 100°C. The nucleic acid 
fragments may be reannealed by cooling. Preferably the temperature is 
from 20°C to 75°C. Renaturation can be accelerated by the addition of 
polyethylene glycol ("PEG") or salt. A suitable salt concentration may 
range from 0 mM to 200 mM. The annealed nucleic acid fragments are 
then incubated in the presence of a nucleic acid polymerase and dNTP's 
(i.e. dATP, dCTP, dGTP and dTTP). The nucleic acid polymerase may be 
the Klenow fragment, the Taq polymerase or any other DNA polymerase 
known in the art. The polymerase may be added to the random nucleic 
acid fragments prior to annealing, simultaneously with annealing or after 
annealing. The cycle of denaturation, renaturation and incubation in the 
presence of polymerase is repeated for a desired number of times. 
Preferably the cycle is repeated from 2 to 50 times, more preferably the 
sequence is repeated from 10 to 40 times. The resulting nucleic acid is a 
larger double : stranded polynucleotide ranging from about 50 bp to about 
100 kb and may be screened for expression and altered activity by 
standard cloning and expression protocol. (Manatis supra). 

Furthermore, a hybrid protein can be assembled by fusion of 
functional domains using the gene shuffling (exon shuffling) method 
(Nixon era/, PNAS, 94:1069-1073 (1997)). The functional domain of the 
instant gene can be combined with the functional domain of other genes 
to create novel enzymes with desired catalytic function. A hybrid enzyme 
may be constructed using PCR overlap extension method and cloned into 
the various expression vectors using the techniques well known to those 
skilled in art. 

EXAMPLES 

The present invention is further defined in the following Examples. 
It should be understood that these Examples, while indicating preferred 
embodiments of the invention, are given by way of illustration only. From 
the above discussion and these Examples, one skilled in the art can 
ascertain the essential characteristics of this invention, and without 
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departing from the spirit and scope thereof, can make various changes 
and modifications of the invention to adapt it to various usages and 
conditions. 

GENERAL METHODS 

5 Standard recombinant DNA and molecular cloning techniques used 

in the Examples are well known in the art and are described by Sambrook, 
J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory 
Manual; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 
(1989) (Maniatis) and byT. J. Silhavy, M. L Bennan, and L. W. Enquist, 

10 Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, N.Y. (1984) and by Ausubel, F. M. et a/., Current Protocols 
in Molecular Biology, pub. by Greene Publishing Assoc. and Wiley- 
Interscience (1987). 

Materials and methods suitable for the maintenance and growth of 

15 bacterial cultures are well known in the art. Techniques suitable for use in 
the following examples may be found as set out in Manual of Methods for 
General Bacteriology (Phillipp Gerhardt, R. G. E. Murray, Ralph N. 
Costilow, Eugene W. Nester, Willis A. Wood, Noel R. Krieg and G. Briggs 
Phillips, Eds., American Society for Microbiology, Washington, DC. 

20 (1 994)) or by Thomas D. Brock in Biotechnology: A Textbook of Industrial 
Microbiology . Second Ed., Sinauer Associates, Inc.: Sunderland, MA 
(1989). All reagents, restriction enzymes and materials used for the 
growth and maintenance of bacterial cells were obtained from Aldrich 
Chemicals (Milwaukee, Wl), DIFCO Laboratories (Detroit, Ml), 

25 GIBCO/BRL (Gaithersburg, MD), or Sigma Chemical Company (St. Louis, 
MO) unless otherwise specified. 

Bacterial Strains and Plasmids : Rhodococcus erythropolis AN 12, 
Brevibacterium sp. HCU, Arthrobacter sp. BP2, Rhodococcus sp. phil, 
Rhodococcus sp. phi2, Acidovorax sp. CHX, and Acinetobacter sp. SE19 

30 were isolated from enrichment of activated sludge obtained from industrial 
wastewater treatment facilities. Max Efficiency competent cells of E. coli 
DH5a and DH10B were purchased from GIBCO/BRL (Gaithersburg, MD). 
Expression plasmid pQE30 were purchased from Qiagen (Valencia, CA), 
while cloning vector pCR2.1 and expression vector pTrc/His2-Topo were 

35 purchased from Invitrogen (San Diego, CA). 

Taxonomic identification of Rhodococcus erythropolis AN 12, 
Brevibacterium sp. HCU, Arthrobacter sp. BP2, Rhodococcus sp. phil, 
Rhodococcus sp. phi2, Acidovorax sp. CHX, and Acinetobacter sp. SE19 
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was performed by PCR amplification of 16S rDNA from chromosomal 
DNA using primers corresponding to conserved regions of the 16S rDNA 
molecule (Table 2). The following temperature program was used: 95°C 
(5 min) for 1 cycle followed by 25 cycles of: 95°C (1 min), 55°C (1 min), 
72°C (1 min), followed by a final extension at 72°C (8 min). Following 
DNA sequencing (according to the method shown below), the 16S rDNA 
gene sequence of each isolate was used as the query sequence for a 
BLAST search (Altschul, et a/., Nucleic Acids Res. 25:3389-3402 (1997)) 
against GenBank for similar sequences. 



Table 2 



SEQID 
NO 


Primer Sequence (5 - 3') 


Reference 


50 


GAGTTTGATCCTGGCTC 
AG 


(HK12) Amann, R.I. et al. Microbiol. 
Rev. 59(1 ):143-69 (1995) 


51 


CAGG(A/C)GCCGCGGTA 
AT(A/T)C 


Amann, R.I. et al. Microbiol. Rev. 
59(1 ):1 43-69 (1995) 


52 


GCTGCCTCCCGTAGGA 
GT 


(HK21) Amann, R.I. et al. Microbiol. 
Rev. 59(1):143-69 (1995) 


53 


CTACCAGGGTAACTAAT 
CC 


Amann, R.I. et al. Microbiol. Rev. 
59(1 ):1 43-69 (1995) 


54 


ACGGGCGGTGTGTAC 


Amann, R.I. etal. Microbiol. Rev. 
59(1 ):1 43-69 (1995) 


55 


CACGAGCTGACGACAG 
CCAT 


Amann, R.I. et al. Microbiol. Rev. 
59(1 ):143-69 (1995) 


56 


TACCTTGTTACGACTT 


(HK13) Amann, R.I. et al. Microbiol. 
Rev. 59(1):143-69 (1995) 


57 


G(A/T)ATTACCGCGGC( 
G/T)GCTG 


Amann, R.I. et al. Microbiol. Rev. 
59(1):143-69 (1995) 


58 


GGATTAGATACCCTGGT 
AG 


Amann, R.I. et al. Microbiol. Rev. 
59(1 ):1 43-69 (1995) 


59 


ATGGCTGTCGTCAGCT 
CGTG 


Amann, R.I. et al. Microbiol. Rev. 
59(1 ):1 43-69 (1995) 


60 


GCCCCCG(C/r)CAATTC 
CT 


(HK15) Kane, M.D. et al. Appl. 
Environ. Microbiol. 59:682-686 
(1993) 
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SEQ ID 
NO 


Primer Sequence (5- 3') 


Reference 


61 


GTGCCAGCAG(OT)(A/C) 
GCGGT 


(HK14) Kane, M.D. et al. Appl. 
Environ. Microbiol. 59:682-686 
(1993) 


62 


GCCAGCAGCCGCGGTA 


(JCR15) Kane, M.D. et al. Appl. 
Environ. Microbiol. 59:682-686 
(1993) 



Note: Parenthetical information in bold is the original name for the primer, according to the 
reference provided. 



Sequencing 

5 Sequence was generated on an ABI Automatic sequencer using 

dye terminator technology (U.S. Patent 5,366,860; EP 272007) using a 
combination of vector and insert-specific primers. Sequence editing was 
performed using either Sequencher (Gene Codes Corp., Ann Arbor, Ml), or 
the Wisconsin GCG program (Wisconsin Package Version 9.0, Genetics 
10 Computer Group (GCG), Madison, Wl) and the CONSED package 

(version 7.0). All sequences represent coverage at least two times in both 
directions. 

Manipulations of genetic sequences were accomplished using the 
suite of programs available from the Genetics Computer Group Inc. 

15 (Wisconsin Package Version 9.0, Genetics Computer Group (GCG), 
Madison, Wl). Where the GCG program "Pileup" was used, the gap 
creation default value of 12 and the gap extension default value of 4 were 
used. Where the GCG "Gap" or "Bestfif programs were used, the default 
gap creation penalty of 50 and the default gap extension penalty of 3 were 

20 used. In any case where GCG program parameters were not prompted 
for, in these or any other GCG program, default values were used. 

The meaning of abbreviations is as follows: "sec" means 
second(s), "min" means minute(s), "h" means hour(s), "d" means day(s), 
"uL n means microliter, "ml_" means milliliters, "L" means liters, "uM" means 

25 micromolar, "mM" means millimolar, "M" means molar, "mmol" means 
millimole(s), "umole" mean micromole", "g" means gram, "pg" means 
microgram, "ng" means nanogram, "IT means units, "mil" means 
milliunits, "ppm" means parts per million, "psi" means pounds per square 
inch, and "kB" means kilobase. 
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EXAMPLE 1 

Monooxygenase Gene Discovery in a Mixed Microbial Population 
This Example describes the isolation of the cyclohexanone 
degrading organisms Arthrobacter sp. BP2, Rhodococcus sp. phil, and 

5 Rhodococcus sp. phi2 by enrichment of a mixed microbial community. 
Differential display techniques applied to cultures containing the mixed 
microbial population permitted discovery of monooxygenase genes. 
Enrichment for cyclohexanone dearaders 

A mixed microbial community was obtained from a wastewater 

10 bioreactor and maintained on minimal medium (50 mM KHP0 4 (pH 7.0), 
10 mM (NH 4 )S0 4 , 2 mM MgCI 2 , 0.7 mM CaCI 2 , 50 uM MnCI 2 , 1 uM 
FeCI 3 , 1 uM ZnCI 3 , 1.72 uM CuS0 4 , 2.53 uM CoCI 2 , 2.42 uM Na 2 Mo0 2 , 
and 0.0001% FeS0 4 ) with trace amounts of yeast extract casamino acids 
and peptone (YECAAP) at 0.1% concentration with 0.1% cyclohexanol 

15 and cyclohexanone added as carbon sources. Increased culture growth in 
the presence of cyclohexanone indicated a microbial population with 
members that could convert cyclohexanone. 
Isolation of Strains 

Seven individual strains were isolated from the community by 

20 spreading culture on R2A Agar (Becton Dickinson and Company, 

Cockeysville, MD) at 30° C. Strains were streaked to purity on the same 
medium. Among these seven strains, the strain identified as Arthrobacter 
species BP2 formed large colonies of a light yellow color. One 
Rhodococcus strain, identified as species phil, formed small colonies that 

25 were orange in color. The other Rhodococcus strain, designated species 
phi2, formed small colonies that were red in color. 

Individuals strains were identified by comparing 16s rDNA 
sequences to known 16S rRNA sequences in the GenBank sequence 
database. The 16S rRNA gene sequence from strain BP2 (SEQ ID NO:1) 

30 was at least 99% homologous to the 16S rRNA gene sequences of 
bacteria belonging to the genus Arthrobacter. The 16S rRNA gene 
sequences from strains phil and phi2 were each at least 99% 
homologous to the 16S rRNA gene sequences of bacteria belonging to 
the genus of gram positive bacteria, Rhodococcus. The complete 16s 

35 DNA sequence of Rhodococcus sp. phil is shown as SEQ ID NO:2, while 
that of Rhodococcus sp. phi2 is listed as SEQ ID NO:3. 
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Induction of cyclohexanone oxidation genes 

For induction of cyclohexanone oxidation genes within members of 
this community, 1 ml of inoculum from a waste water bioreactor was 
suspended in 25 ml minimal medium with 0.1% YECAAP and incubated 

5 overnight at 30°C with agitation. The next day 1 0 ml of the overnight 
culture was resuspended in a total volume of 50 ml minimal medium with 
0.1 % YECAAP. The optical density of the culture was 0.29 absorbance 
units at 600 nm. After equilibration at 30°C for 30 min, the culture was 
split into two separate 25 ml volumes. To one of these cultures, 25 pi 

10 (0.1 %) cyclohexanone (Sigma-AIdrich, St. Louis, MO) was added. Both 
cultures were incubated for an additional 3 hrs. At this time, cultures were 
moved onto ice, harvested by centrifugation at 4°C, washed with two 
volumes of minimal salts medium and diluted to an optical density of 1 .0 
absorbance unit (600 nm). Approximately 6 ml of culture was placed in a 

15 water jacketed respirometry cell equipped with an oxygen electrode 

(Yellow Springs Instruments Co., Yellow Springs, OH) at 30°C to confirm 
cyclohexanone enzymes were induced. After establishing the baseline 
respiration for each cell suspension, cyclohexanone was added to a final 
concentration of 0.1 % and the rate of 0 2 consumption was further 

20 monitored. For the control culture, 2 mM potassium acetate was added 
200 sec after the cyclohexanone. 
Isolation of total community RNA 

After the 3 hr induction period with cyclohexanone described 
above, the control and induced sample (2 mL each) were harvested at 

25 1 400 rpm in a 4 °C centrifuge and resuspended in 900 pi Buffer RLT 
(Qiagen, Valencia, CA). A 300 pi volume of zirconia beads (Biospec 
Products, Bartlesville, OK) was added and cells were disrupted using a 
bead beater (Biospec Products) at 2400 beats per min for 3 min. Each of 
these samples was split into six aliquots for nucleic acid isolation using the 

30 RNeasy Mini Kit (Qiagen, Valencia, CA) and each was eluted with 100 
RNase-free dH 2 0 supplied with the kit. DNA was degraded in the 
samples using 10 mM MgCI 2 , 60 mM KCI and 2 U RNase-free DNase I 
(Ambion, Austin, TX) at 37 °C for 4 hr. Following testing for total DNA 
degradation by PCR using one of the arbitrary oligonucleotides used for 

35 RT-PCR, RNA was purified using the RNeasy Mini Kit and eluted in 1 00 pi 
RNase-free dH 2 0 as described previously. 
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Generation of RAPDs from arbitrarily reverse-transcribed total RNA 

A set of 244 primers with the sequence CGGAGCAGATCGAWW 
(SEQ ID NO:63); where WW represent all the combinations of the three 
bases A, G and C) was used in separate RT-PCR reactions as with RNA 
5 from either the control or induced cells. The Superscript™ One-Step™ 
RT-PCR System (Life Technologies Gibco BRL, Rockville, MD) reaction 
mixture was used with 2-5 ng of total RNA in a 25 pi total reaction volume. 
The PCR was conducted using the following temperature program: 

1 cycle: 4 °C (2 min), 5 min ramp to 37 °C (1 hr), followed by 95 °C 
10 incubation (3 min); 

1 cycle: 94 °C (1 min), 40 °C (5 min), and 72 °C (5 min); 

40 cycles: 94 °C (1 min), 60 °C (1 min), and 72 °C (1 min); 

1 cycle: 70 °C (5 min) and 4 °C hold until separated by 
electorphoresis. 

1 5 Products of these PCR amplifications (essentially RAPD fragments) 

were separated by electrophoresis at 1 V/cm on polyacrylamide gels 
(Amersham Pharmacia Biotech, Piscataway, NJ). Products resulting from 
the control mRNA (no cyclohexanone induction) and induced mRNA 
fragments were visualized by silver staining using an automated gel 

20 stainer (Amersham Pharmacia Biotech, Piscataway, NJ). 
Reamplification of differentially expressed DNA fragments 

A 25 pi volume of a sodium cyanide elution buffer (10mg/ml NaCN, 
20 mM Tris-HCI (pH 8.0), 50 mM KCI and 0.05% NP40) was incubated 
with an excised gel band of a differentially display fragment at 95°C for 20 

25 min. Reamplification of this DNA fragment was achieved in a PCR 
reaction using 5 pi of the elution mixture in a 25 pi reaction using the 
primer from which the fragment was originally generated. The 
temperature program for reamplification was: 94 °C (5 min); 20 cycles of 
94 °C (1 min), 55 °C (1 min), and 72 °C (1 min); followed by 72 °C (7 min). 

30 The reamplification products were directly cloned into the pCR2.1 -TOPO 
vector (Invitrogen, Carlsbad, CA) and were sequenced using an ABI 
model 377 with ABI BigDye terminator sequencing chemistry (Perseptive 
Biosystems, Framinham, MA). Eight clones were submitted for 
sequencing from each reamplified band. The nucleotide sequence of the 

35 cloned fragments was compared against the non-redundant GenBank 
database using the BlastX program (NCBI). 



44 



WO 03/020890 



PCT/US02/27549 



Sequencing of cvclohexanone oxidation pathway genes 

Oligonucleotides were designed to amplify by PCR individual 
differentially expressed fragments. Following DNA isolation from 
individual strains, these oligonucleotide primers were used to determine 

5 which strain contained DNA encoding the individual differentially 
expressed fragments. Cosmids were screened by PCR using primers 
designed against differentially displayed fragments with homology to 
known cyclohexanone degradation genes. Each recombinant E. coli cell 
culture carrying a cosmid clone (1.0 ul) was used as the template in a 25 

10 ul PCR reaction mixture. The primer pair A102FI (SEQ ID NO: 108) and 
CONR (SEQ ID NO:109) was used to screen the Arthrobacter sp. BP2 
library, primer pair A228FI (SEQ ID NO:110) and A228RI (SEQ ID 
NO:1 1 1 ) was used to screen the Rhodococcus sp. phi2 library, and the 
primer pair of A2FI (SEQ IDNO:112) andA34RI (SEQ ID NO: 11 3) was 

15 used to screen the Rhodococcus sp. phil library. Cosmids from 
recombinant E. coli which produced the correct product size in PCR 
reactions were isolated, digested partially with Sau3AI and 1 0-1 5 kB 
fragments from this partial digest were sub-cloned into the blue/white 
screening vector pSU19 (Bartolome, B. et al. Gene. 102(1): 75-8 (Jun 15, 

20 1 991 ); Martinez, E. et al. Gene. 68(1 ): 1 59-62 (Aug 1 5, 1 988)). These 
sub-clones were isolated using Qiagen Turbo96 Miniprep kits and re- 
screened by PCR as previously described. Sub-clones carrying the 
correct sequence fragment were transposed with pGPS1 .1 using the 
GPS-1 Genome Priming System kit (New England Biolabs, Inc., Beverly, 

25 MA). A number of these transposed plasmids were sequenced from each 
end of the transposon to obtain kilobase long DNA fragments. Sequence 
assembly was performed with the Sequencher program (Gene Codes 
Corp., Ann Arbor Ml). 

EXAMPLE 2 

30 Isolation of Brevibacterium sp. HCU Monooxvqenase Genes 

Involved In The Oxidation Of Cvclohexanone 
This Example describes the isolation of the cyclohexanol and 
cyclohexanone degrader Brevibacterium sp. HCU. Discovery of BV 
monooxygenase genes from the organism was accomplished using 
35 differential display methods. 
Strain Isolation 

Selection for a halotolerant bacterium degrading cyclohexanol and 
cyclohexanone was performed on agar plates of a halophilic minimal 
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medium (Per liter: 15 g Agar, 100 g NaCI, 10 g MgS0 4 , 2 g KCI, 1 g 
NH 4 CI, 50 mg KH 2 P0 4 , 2 mg FeS0 4 , 8 g, Tris-HCI (pH 7)) containing 
traces of yeast extract and casaminoacids (0.005% each) and incubated 
under vapors of cyclohexanone at 30°C. The inoculum was a 
5 resuspension of sludge from industrial wastewater treatment plant. After 
two weeks, beige colonies were observed and streaked to purity on fresh 
agar plates grown under the same conditions. 

The complete 16s DNA sequence of the isolated Brevibacterium 
sp. HCU was found to be unique and is shown as SEQ ID NO:4. 

10 Comparison to other 1 6S rRNA sequences in the GenBank sequence 
database found the 16S rRNA gene sequence from strain HCU was at 
least 99% homologous to the 16S rRNA gene sequences of bacteria 
belonging to the genus Brevibacterium. 
Induction of the Cyclohexanone Degradation Pathway 

15 Induciblity of the cyclohexanone pathway was tested by 

respirometry in low salt medium. One colony of Brevibacterium sp. HCU 
was inoculated in 300 ml of S12 mineral medium (50 mM KHP0 4 buffer 
(pH 7.0), 10 mM (NH4)2S0 4 , 2 mM MgCI 2 , 0.7 mM CaCI 2 , 50 uM MnCI 2 , 
1 uM FeCI 3 , 1 uM ZnCI 3 , 1.72 uM CuS0 4 , 2.53 uM CoCI 2 , 2.42 u.M 

20 Na 2 Mo0 2 , and 0.0001 % FeS0 4 ) containing 0.005% yeast extract. The 
culture was then split into two flasks which received respectively 10 mM 
acetate and 10 mM cyclohexanone. Each flask was incubated for 6 hrs at 
30°C to allow for the induction of the cyclohexanone degradation genes. 
The cultures were then chilled on iced, harvested by centrifugation and 

25 washed three times with ice-cold S1 2 medium lacking traces of yeast 
extract. Cells were finally resuspended to an optical density of 2.0 at 
600 nm and kept on ice until assayed. 

Half a ml of each culture was placed in a water jacketed 
respirometry cell equipped with an oxygen electrode (Yellow Spring 

30 Instruments Co., Yellow spring, OH) and containing 5 ml of air saturated 
S12 medium at 30°C. After establishing the baseline respiration for each 
of the cell suspensions, acetate or cyclohexanone was added to a final 
concentration of 0.02% and the rate of 0 2 consumption was further 
monitored. 

35 Identification of Cyclohexanone Oxidation Genes 

Identification of genes involved in the oxidation of cyclohexanone 
made use of the fact that this oxidation pathway is inducible. The mRNA 
populations of a control culture and a cyclohexanone-induced culture were 
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compared using a technique based on the random amplification of DNA 
fragments by reverse transcription followed by PCR. 
Isolation of Total Cellular RNA 

The cyclohexanone oxidation pathway was induced by addition of 
5 0.1% cyclohexanone into one of two "split" 10 ml cultures of 

Brevibacterium sp. HCU grown in S12 medium. Each culture was chilled 
rapidly in an ice-water bath and transferred to a 15 ml tube. Cells were 
collected by centrifugation for 2 min at 12,000 x g in a rotor chilled to -4°C. 
The supernatants were discarded, the pellets resuspended in 0.7 ml of 
10 ice-cold solution of 1 % SDS and 100 mM sodium acetate at pH 5 and 
transferred to a 2 ml tube containing 0.7 ml of aqueous phenol pH 5 and 
0.3 ml of 0.5 mm zirconia beads (Biospec Products, Bartlesville, OK). The 
tubes were placed in a bead beater (Biospec) and disrupted at 
2,400 beats per min for two min. 
15 Following the disruption of the cells, the liquid phases of the tubes 

were transferred to new microfuge tubes and the phases separated by 
centrifugation for 3 min at 1 5,000 x g. The aqueous phase containing total 
RNA was extracted twice more with phenol at pH 5 and twice with a 
mixture of phenol/chloroform/isoamyl alcohol pH 7.5 until a precipitate was 
20 no longer visible at the phenol/water interface. Nucleic acids were then 
recovered from the aqueous phase by ethanol precipitation with three 
volumes of ethanol and the pellet resuspended in 0.5 ml of diethyl 
pyrocarbonate (DEPC) treated water. DNA was digested by 6 units of 
RNAse-free DNAse (Boehringer Mannheim, Indianapolis, IN) for 1 hr at 
25 37°C. The total RNA solution was then extracted twice with 
phenol/chloroform/isoamyl alcohol pH 7.5, recovered by ethanol 
precipitation and resuspended in 1 ml of DEPC treated water to an 
approximate concentration of 0.5 mg per ml. 

Generation of RAPDs Patterns From Arbitrarily Reverse- 
30 Transcribed Total RNA 

Arbitrarily amplified DNA fragments were generated from the total 
RNA of control and induced cells by following the protocol described by 
Wong K.K. ef a/. (Proc Natl Acad Sci USA.M :639 (1 994)). A series of 
parallel reverse transcription (RT)/PCR amplification experiments were 
35 performed using a RT-PCR oligonucleotide set. This set consisted of 
81 primers, each designed with the sequence CGGAGCAGATCGAWW 
(SEQ ID NO:63) where WW represent all the combinations of the three 
bases A, G and C at the last four positions of the 3'-end. 
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The series of parallel RT-PCR amplification experiments were 
performed on the total RNA from the control and induced cells, each using 
a single RT-PCR oligonucleotide. Briefly, 50 ul reverse transcription (RT) 
reactions were performed on 20-100 ng of total RNA using 100 U 
5 Moloney Murine Leukemia Virus (MMLV) reverse transcriptase (Promega, 
Madison, Wl) with 0.5 mM of each dNTP and 1 mM for each 
oligonucleotide primer. Reactions were prepared on ice and incubated at 
37°C for 1 hr. 

Five pJ from each RT reaction were then used as template in a 

10 50 ul PCR reaction containing the same primer used for the RT reaction 
(0.25 \M), dNTPs (0.2 mM each), magnesium acetate (4 mM) and 2.5 U 
of the Taq DNA polymerase Stoffel fragment (Perkin Elmer, Foster City, 
CA). The following temperature program was used: 94°C (5 min), 40°C 
(5 min), 72°C (5 min) for 1 cycle followed by 40 cycles of 94°C (1 min), 

15 60°C (1 min), 72°C (5 min). 

RAPD fragments were separated by electrophoresis on acrylamide 
gels (15 cm x 15 cm x 1.5 mm, 6% acrylamide, 29:1 acryhbisacrylamide, 
100 mM Tris, 90 mM borate, 1 mM EDTA pH 8.3). Five ^il from each PCR 
reaction were analyzed with the reactions from the control and the induced 

20 RNA for each primer running side by side. Electrophoresis was performed 
at 1 V/cm. DNA fragments were visualized by silver staining using the 
Plus One® DNA silver staining kit in the Hoefer automated gel stainer 
(Amersham Pharmacia Biotech, Piscataway, NJ). 

Reamplification of the Differentially Expressed DNA 

25 Stained gels were rinsed extensively for one hr with distilled water. 

Bands generated from the RNA of cyclohexanone induced cells but 
absent in the reaction from the RNA of control cells were excised from the 
gel and placed in a tube containing 50 ul of 10 mM KCI and 10 mM 
Tris-HCI (pH 8.3) and heated to 95°C for 1 hrto allow some of the DNA to 

30 diffuse out of the gel. Serial dilutions of the eluate over a 200 fold range 
were used as template for a new PCR reaction using the Taq polymerase. 
The primer used for each reamplification (0.25 uM) was the one that had 
generated the pattern. 

Each reamplified fragment was cloned into the blue/white cloning 

35 vector pCR2.1 (Invitrogen, San Diego, CA) and sequenced using the 
universal forward and reverse primers (M13 Reverse Primer (SEQ ID 
NO:64) and M13 (-20) Forward Primer (SEQ ID NO:65). 
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Extension of monooxvaenase fragments bv Out-PCR. 
Kilobase-long DNA fragments extending the sequences fragments 
identified by differential display were generated by "Out-PCR", a PCR 
technique using an arbitrary primer in addition to a sequence specific 

5 primer. The first step of this PCR-based gene walking technique 

consisted of randomly copying the chromosomal DNA using a primer of 
arbitrary sequence in a single round of amplification under low stringency 
conditions. The primers used for Out-PCR were chosen from a primer set 
used for mRNA differential display and their sequences were 

10 CGGAGCAGATCGAVWV (SEQ ID NO:63) where WW was A, G or C. 
Ten Out-PCR reactions were performed, each using one primer of 
arbitrary sequence. The reactions (50 pi) included a 1X concentration of 
the rTth XL buffer provided by the manufacturer (Perkin-Elmer, Foster 
City, CA), 1.2 mM magnesium acetate, 0.2 mM of each dNTP, 10-100 ng 

15 genomic DNA, 0.4 mM of one arbitrary primer and 1 unit of rTth XL 
polymerase (Perkin-Elmer). A five min annealing (45°C) and 15 min 
extension cycle (72°C) lead to the copying of the genomic DNA at arbitrary 
sites and the incorporation of a primer of arbitrary but known sequence at 
the 3' end. 

20 After these initial low stringency annealing and replication steps, 

each reaction was split into two tubes. One tube received a specific 
primer (0.4 mM) designed against the end of the sequence to be extended 
and directed outward, while the second tube received water and was used 
as a control. Thirty additional PCR cycles were performed under higher 

25 stringency conditions with denaturization at 94°C (1 min), annealing at 
60°C (0.5 min) and extension at 72°C (10 min). The long extension time 
was designed to allow for the synthesis of long DNA fragments by the long 
range rTth XL DNA polymerase. The products of each pair of reactions 
were analyzed in adjacent lanes on an agarose gel. 

30 Bands present in the sample having received the specific primer 

but not in the control sample were excised from the agarose gel, melted in 
0.5 ml H 2 0 and used as the template in a new set of PCR reactions. A 1X 
concentration of rTth XL buffer, 1.2 mM magnesium acetate, 0.2 mM of 
each dNTP, 0.4 mM of primers, 1/1000 dilution of the melted slice and 

35 1 unit of rTth XL polymerase were used for these reactions. The PCR was 
performed at 94°C (1 min), 60°C (0.5 min), and 72°C (15 min) per cycle 
for 20 cycles. For each of these reamplification reactions, two control 
reactions, lacking either the arbitrary primer or the specific primer, were 
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included in order to confirm that the reamplification of the band of interest 
required both the specific and arbitrary primer. DNA fragments that 
required both the specific and arbitrary primer for amplification were 
sequenced. For sequencing, the long fragments obtained by Out-PCR 
5 were partially digested with Mbo\ and cloned into pCR2.1 (Invitrogen, 
Carlsbad, CA). Sequences for these partial fragments were obtained 
using primers designed against the vector sequence. 

EXAMPLE 3 

Isolation of a Acidovorax sd. CHX Monooxvaenase Gene Involved in 
10 Degradation of Cvclohexane 

This Example describes the isolation of the cyclohexane degrader 
Acidovorax sp. CHX. Discovery of a BVMO gene was accomplished using 
differential display methods. 
Strain Isolation 

15 An enrichment for bacteria growing on cyclohexane as a sole 

carbon source was started by adding 5 ml of an industrial wastewater 
sludge to 20 ml of mineral medium (50 mM KHP0 4 (pH 7.0), 10 mM 
(NH 4 )S0 4 , 2 mM MgCI 2 , 0.7 mM CaCI 2 , 50 pM MnCI 2 , 1 pM FeCI 3> 1 uM 
ZnCI 3 , 1.72 pM CuS0 4 , 2.53 pM CoCI 2 , 2.42 pM Na 2 Mo0 2 , and 0.0001% 

20 FeS0 4 ) in a 1 25 ml Erlenmeyer flask sealed with a Teflon lined screw cap. 
A test tube containing 1 ml of a mixture of mineral oil and cyclohexane 
(8/1 v/v) was fitted in the flask to provide a low vapor pressure of 
cyclohexane (approximately 30% of the vapor pressure of pure 
cyclohexane). The enrichment was incubated at 30°C for a week. 

25 Periodically, 1 to 10 dilutions of the enrichment were performed in the 
same mineral medium supplemented with 0.005% of yeast extract under 
low cyclohexane vapors. After several transfers, white flocks could be 
seen in the enrichments under cyclohexane vapors. If cyclohexane was 
omitted, the flocks did not grow. 

30 After several transfers, the flocks could be grown with 4 ul of liquid 

cyclohexanone added directly to 10 ml of medium. To isolate colonies, 
flocks were washed in medium and disrupted by thorough shaking in a 
bead beater. The cells released from the disrupted flocks were streaked 
onto R2A medium agar plates and incubated under cyclohexane vapors. 

35 Pinpoint colonies were picked under a dissecting microscope and 

inoculated in 10 ml of mineral medium supplemented with 0.01% yeast 
extract and 4 pi of cyclohexane. The flocks were grown, disrupted and 
streaked again until a pure culture was obtained. 
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Taxonomic identification of this isolate was performed by PCR 
amplification of 16S rDNA, as described in the General Methods. The 
16S rRNA gene sequence from strain CHX was at least 98% homologous 
to the 16S rRNA gene sequence of an uncultured bacterium (Seq. 

5 Accession number AF1 43840) and 95% homologous to the 1 6s rRNA 
gene sequences of the genus Acidovorax termperans (Accession number 
AF078766). The complete 16s DNA sequence of the isolated Acidovorax 
sp. CHX is shown as SEQ ID NO:5. 
Induction of Cvclohexane Degradation Genes 

10 For induction of cyclohexane degradation genes, colonies of 

Acidovorax sp. CHX were scraped from an R2A agar plate and inoculated 
into 25 ml R2A broth. This culture was incubated overnight at 30°C. The 
next day 25 ml of fresh R2A broth was added and growth was continued 
for 15 min. The culture was split into two separate flasks, each of which 

15 received 25 ml. To one of these flasks, 5 pi of pure cyclohexane was 

added to induce expression of cyclohexane degradation genes. The other 
flask was kept as a control. Differential display was used to identify the 
Acidovorax sp. CHX monooxygenase gene. Identification of cyclohexane 
induced gene sequences and sequencing cyclohexanone oxidation genes 

20 from strains was performed in a similar manner as described in Example 
1. 

EXAMPLE 4 

Isolation of a Acinetobacter six. SE19 Monooxvaenase G ene Involved in 
Degradation of Cvclohexanol 
25 This Example describes the isolation of the cyclohexanol degrader 

Acinetobacter sp. SE1 9. Discovery of a BV monooxygenase gene was 
accomplished by screening of cosmid libraries, followed by sequencing of 
shot-gun libraries. 
Isolation of Strain 

30 An enrichment for bacteria that grow on cyclohexanol was isolated 

from a cyclopentanol enrichment culture. The enrichment culture was 
established by inoculating 1 mL of activated sludge into 20 mL of S12 
medium (10 mM ammonium sulfate, 50 mM potassium phosphate buffer 
(pH 7.0). 2 mM MgCI 2 , 0.7 mM CaCI 2 , 50 uM MnCI 2 , 1 uM FeCI 3 , 1 uM 

35 ZnCI 3 , 1 .72 uM CuS0 4 , 2.53 uM CoCI 2 , 2.42 uM Na 2 Mo0 2 , and 0.0001 % 
FeS0 4 ) in a sealed 125 mL screw-cap Erlenmeyer flask. The enrichment 
culture was supplemented with 100 ppm cyclopentanol added directly to 
the culture medium and was incubated at 35°C with reciprocal shaking. 
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The enrichment culture was maintained by adding 100 ppm cyclopentanol 
every 2-3 days. The culture was diluted every 2-10 days by replacing 
10 ml_ of the culture with the same volume of S12 medium. After 15 days 
of incubation, serial dilutions of the enrichment culture were spread onto 

5 LB plates. Single colonies were screened for the ability to grow on S1 2 
liquid with cyclohexanol as the sole carbon and energy source. The 
cultures were grown at 35°C in sealed tubes. One of the isolates, strain 
SE19 was selected for further characterization. 

The 16s rRNA genes of SE19 isolates were amplified by PCR 

10 according to the procedures of the General Methods. Result from all 
isolates showed that strain SE19 has close homology to Acinetobacter 
haemolyticus and Acinetobacter junii, (99% nucleotide identity to each). 
Construction Of /Ac/netobacterCosmid Libraries 

Acinetobacter sp. SE19 was grown in 25 ml LB medium for 6 h at 

15 37"C with aeration. Bacterial cells were centrifuged at 6,000 rpm for 
10 min in a Sorvall RC5C centrifuge at4°C. Supernatant was decanted 
and the cell pellet was frozen at -80X. Chromosomal DNA was prepared 
as outlined below with special care taken to avoid shearing of DNA. The 
cell pellet was gently resuspended in 5 ml of 50 mM Tris-10 mM EDTA 

20 (pH 8) and lysozyme was added to a final concentration of 2 mg/ml. The 
suspension was incubated at 37°C for 1 h. Sodium dodecyl sulfate was 
then added to a final concentration of 1% and proteinase K was added at 
100ug/ml. The suspension was incubated at 55°C for 2 h. The 
suspension became clear and the clear lysate was extracted with equal 

25 volume of phenol:chloroform:isoamy! alcohol (25:24:1 ). After centrifuging 
at 12,000 rpm for 20 min, the aqueous phase was carefully removed and 
transferred to a new tube. Two volumes of ethanol were added and the 
DNA was gently spooled with a sealed glass pasteur pipet. The DNA was 
dipped into a tube containing 70% ethanol. After air drying, the DNA was 

30 resuspended in 400 pi of TE (1 0 mMTris-1 mM EDTA, pH 8) with RNaseA 
(100 ug/ml) and stored at 4°C. The concentration and purity of DNA was 
determined spectrophotometrically by OD 2 6o/OD 2 8o- A diluted aliquot of 
DNA was run on a 0.5% agarose gel to determine the intact nature of 
DNA. 

35 Chromosomal DNA was partially digested with Sau3AI 

(GIBRO/BRL, Gaithersburg, MD) as outlined by the instruction manual for 
the SuperCos 1 Cosmid Vector Kit. DNA (10 pg) was digested with 0.5 
unit of Sau3AI at room temperature in 1 00 pi of reaction volume. Aliquots 
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of 20 pi were withdrawn at various time points of the digestion: e.g., 0, 3, 
6, 9, 12 min. DNA loading buffer was added and samples were analyzed 
on a 0.5% agarose gel to determine the extent of digestion. A decrease in 
size of chromosomal DNA corresponded to an increase in the length of 

5 time for Sau3AI digestion. The preparative reaction was performed using 
50 ug of DNA digested with 1 unit of Sau3AI for 3 min at room 
temperature. The digestion was terminated by addition of 8 mM of EDTA. 
The DNA was extracted once with phenol:chloroform:isoamyl alcohol and 
once with chloroform. The aqueous phase was adjusted to 0.3 M NaOAc 

10 and ethanol precipitated. The partially digested DNA was 

dephosphorylated with calf intestinal alkaline phosphatase and ligated to 
SuperCos 1 vector, which had been treated according to the instructions 
in the SuperCos 1 Cosmid Vector Kit. The ligated DNA was packaged 
into lamda phage using Gigapack III XL packaging extract, as 

1 5 recommended by Stratagene (manufacturer's instructions were followed). 
The packaged Acinetobacter genomic DNA library contained a phage titer 
of 5.6 x 10 4 colony forming units per ug of DNA as determined by 
transfecting E. co//XL1-Blue MR. Cosmid DNA was isolated from six 
randomly chosen E. coli transformants and found to contain large inserts 

20 ofDNA(25-40kb). 

Identification and Characterization of Cosmid Clones Containing a 
Cvclohexanone Monooxygenase Gene 

The cosmid library of Acinetobacter sp. SE19 was screened based 
on the homology of the cyclohexanone monooxygenase gene. Two 

25 primers, monoL: GAGTCTGAGCATATGTCACAAAAAATGGATTTTG 
(SEQ ID NO:66) and monoR: 

GAGTCTGAGGGATCCTTAGGCATTGGCAGGTTGCTTGAT (SEQ ID 
NO:67) were designed based on the published sequence of 
cyclohexanone monooxygenase gene of Acinetobacter sp. NCIB 9871. 

30 The cosmid library was screened by PCR using monoL and monoR 
primers. Five positive clones (5B12, 5F5, 8F6, 14B3 and 14D7) were 
identified among about 1000 clones screened. They all contain inserts of 
35-40 kb that show homology to the cyclohexanone monooxygenase gene 
amplified by monoL and monoR primers. Southern hybridization using 

35 this gene fragment as a probe indicated that the cosmid clone 5B1 2 has 
about 20kb region upstream of the monooxygenase gene and cosmid 
clone 8F6 has about 30kb downstream of the monooxygenase gene. 
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Cosmid clone 14B3 contains rearranged Acinetobacter DNA adjacent to 

the monooxygenase gene. 

Construction of shot-gun sequencing libraries 

Shot gun libraries of 5B12 and 8F6 were constructed. Cosmid DNA 

5 was sheared in a nebulizer (Inhalation Plastics Inc., Chicago, IL) at 20 psi 
for 45 sec and the 1-3 kb portion was gel purified. Purified DNA was 
treated with T4 DNA polymerase and T4 polynucleotide kinase following 
manufacturer's (GIBCO/BRL) instructions. Polished inserts were ligated 
into pUC18 vectors using Ready-To-Go pUC18Smal/BAP+Ligase 

10 (GIBCO/BRL). The ligated DNA was transformed into E. coli DH5a cells 
and plated on LB with ampicillin and X-gal. A majority of the 
transformants were white and those containing inserts were sequenced 
with the universal and reverse primers of pUC18 by standard sequencing 
methods. 

15 Shot gun library inserts were sequenced with pUC18 universal and 

reverse primers. Sequences of 200-300 clones from each library were 
assembled using Sequencher 3.0 program. A contig of 17419 bp 
containing the cyclohexanone monooxygenase gene was formed. 

EXAMPLE 5 

20 Isolation and Sequencing of Rhodococcu s erythropolis AN 12 

This Example describes isolation of Rhodococcus erythropolis 
AN 12 strain from wastestream sludge. A shotgun sequencing strategy 
approach permitted sequencing of the entire microbial genome. 
Isolation of Rhodococcus ervthmpolis AN 12 

25 Strain AN 1 2 of Rhodococcus erythropolis was isolated on the basis 

of ability to grow on aniline as the sole source of carbon and energy. 
Bacteria that grow on aniline were isolated from an enrichment culture. 
The enrichment culture was established by inoculating 1 ml of activated 
sludge into 1 0 ml of S1 2 medium (10 mM ammonium sulfate, 50 mM 

30 potassium phosphate buffer (pH 7.0), 2 mM MgCI 2 , 0.7 mM CaCI 2 , 50 \M 
MnCI 2 , 1 uM FeCI 3 , 1 uM ZnCI 3 , 1.72 uM CuS0 4> 2.53 uM CoCI 2 , 2.42 uM 
Na 2 Mo0 2 , and 0.0001% FeS0 4 ) in a 125 ml screw cap Erlenmeyer flask. 
The activated sludge was obtained from a DuPont wastewater treatment 
facility. The enrichment culture was supplemented with 100 ppm aniline 

35 added directly to the culture medium and was incubated at 25°C with 
reciprocal shaking. The enrichment culture was maintained by adding 
100 ppm of aniline every 2-3 days. The culture was diluted every 14 days 
by replacing 9.9 ml of the culture with the same volume of S12 medium. 
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Bacteria that utilize aniline as a sole source of carbon and energy were 
isolated by spreading samples of the enrichment culture onto S12 agar. 
Aniline was placed on the interior of each petri dish lid. The petri dishes 
were sealed with parafilm and incubated upside down at room 

5 temperature (25°C). Representative bacterial colonies were then tested 
for the ability to use aniline as a sole source of carbon and energy. 
Colonies were transferred from the original S12 agar plates used for initial 
isolation to new S12 agar plates and supplied with aniline on the interior of 
each petri dish lid. The petri dishes were sealed with parafilm and 

10 incubated upside down at room temperature (25°C). 

A 16S rRNA gene of strain AN12 was sequenced (SEQ ID NO:6) 
as described in the General Methods and compared to other 16S rRNA 
sequences in the GenBank sequence database. The 16S rRNA gene 
sequence from strain AN 1 2 was at least 98% homologous to the 1 6S 

15 rRNA gene sequences of high G + C Gram positive bacteria belonging to 
the genus Rhodococcus. 

Preparation of Genomic DNA for Sequencing and Se quence Generation 
Genomic DNA and library construction were prepared according to 
published protocols (Fraser et a/. Science 270(5235): 397-403 (1995)). A 

20 cell pellet was resuspended in a solution containing 100 mM Na-EDTA 
(pH 8.0), 10 mM Tris-HCI (pH 8.0), 400 mM NaCI, and 50 mM MgCI 2 . 

Genomic DNA preparation After resuspension, the cells were 
gently lysed in 10% SDS, and incubated for 30 minutes at 55°C. After 
incubation at room temperature, proteinase K (Boehringer Mannheim, 

25 Indianapolis, IN) was added to 100 ng/ml and incubated at 37°C until the 
suspension was clear. DNA was extracted twice with Tris-equilibrated 
phenol and twice with chloroform. DNA was precipitated in 70% ethanol 
and resuspended in a solution containing 10 mM Tris-HCI and 1 mM Na- 
EDTA (TE buffer) pH 7.5. The DNA solution was treated with a mix of 

30 RNAases, then extracted twice with Tris-equilibrated phenol and twice with 
chloroform. This was followed by precipitation in ethanol and 
resuspension in TE buffer. 

Library construction 200 to 500 ug of chromosomal DNA was 
resuspended in a solution of 300 mM sodium acetate, 10 mM Tris-HCI, 

35 1 mM Na-EDTA, and 30% glycerol, and sheared at 1 2 psi for 60 sec in an 
Aeromist Downdraft Nebulizer chamber (IBI Medical products, Chicago, 
IL). The DNA was precipitated, resuspended and treated with Bal31 
nuclease (New England Biolabs, Beverly, MA). After size fractionation, a 
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fraction (2.0 kb, or 5.0 kb) was excised, cleaned and a two-step ligation 
procedure was used to produce a high titer library with greater than 99% 
single inserts. 

Sequencing A shotgun sequencing strategy approach was adopted for 
5 the sequencing of the whole microbial genome (Fleischmann, R. er a/. 
Whole-Genome Random sequencing and assembly of Haemophilus 
influenzae Rd. Science 269(5223): 496-512 (1995)). 

EXAMPLE 6 

Identification and Characterization of Bacterial Genes 

10 Genes encoding each monooxygenase were identified by 

conducting BLAST (Basic Local Alignment Search Tool; Altschul, S. F., et 
a/., (1993) J. Mol. Biol. 215:403-410; see also 
www.ncbi.nlm.nih.gov/BLAST/) searches for similarity to sequences 
contained in the BLAST "nr" database (comprising all non-redundant 

15 GenBank CDS translations, sequences derived from the 3-dimensional 
structure Brookhaven Protein Data Bank, the SWISS-PROT protein 
sequence database, EMBL, and DDBJ databases). The sequences 
obtained in Examples 1, 2, 3, 4, and 5 were analyzed for similarity to all 
publicly available DNA sequences contained in the "nr" database using the 

20 BLASTN algorithm provided by the National Center for Biotechnology 
Information (NCBI). The DNA sequences were translated in all reading 
frames and compared for similarity to all publicly available protein 
sequences contained in the "nr" database using the BLASTX BLOSUM62 
algorithm with a gap exisitense cost of 1 1 per residue gap cost of 2, 

25 filtered, gap alignment (Gish, W. and States, D. J. Nature Genetics 
3:266-272 (1993)) provided by the NCBI. 

All comparisons were done using either the BLASTNnr or 
BLASTXnr algorithm. The results of the BLAST comparisons are given in 
Table 3 which summarize the sequence to which each sequence has the 

30 most similarity. Table 3 displays data based on the BLASTXnr algorithm 
with values reported in expect values. The Expect value estimates the 
statistical significance of the match, specifying the number of matches, 
with a given score, that are expected in a search of a database of this size 
absolutely by chance. 
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monooxygenase, flavin-binding family 
[Mycobacterium tuberculosis 
CDC15511 


>pir||A83453 probable flavin-containing 
monooxygenase PA1 538 [imported] - 
Pseudomonas aeruginosa (strain PA01) 
gb|AAG04927.1 |AE004582_5 
(AE004582) probable flavin-containing 
monooxygenase [Pseudomonas 
aeruainosal 


>gb|AAG10021.1|AF282240_5 
(AF282240) cyclohexanone 
monooxygenase [Acinetobacter sp. 
SE19] 


>pir||JC7158 steroid monooxygenase 
(EC 1.14.99.-) - Rhodococcus 
rhodochrous 

dbjlBAA24454.1| (AB010439) steroid 
monooxygenase [Rhodococcus 
rhodochrous] 


Gene 
Name and 
Organism of 


ORF 17 chnB 

Rhodococcus 

erythropolis 

AN12 


ORF 18 chnB 

Rhodococcus 

erythropolis 

AN12 


ORF 19 chnB I 

Rhodococcus 

erythropolis 

AN12 


ORF 20 chnB 

Rhodococcus 

erythropolis 

AN12 


ORF 
Name 




ao 




o 
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EXAMPLE 7 

Cloning and Expression Of Monooxvaenase Genes into Escherichia coli 

This example illustrates the expression in E. coli of isolated full 
length BVMO genes from Brevibacterium sp. HCU, AcinetobacterSEAS, 
5 Rhodococcus sp. phil , Rhodococcus sp. phi2, Arthrobacter sp. BP2 and 
Acidovorax sp. CHX. 

Full length BVMO's were PCR amplified, using chromosomal DNA 
as the template and the primers shown below in Table 4. 

10 Table 4 

Primers Used for Amplification of Full-Length BV Monooxvaenases 



Monooxygenase 


Forward Primer 


Reverse Primer 


Brevibacterium sp. 
HCU chnB1 


atgccaattacacaacaacttgacc 
(SEQ ID NO:68) 


ctatttcatacccgccgattcac 
(SEQ ID NO:69) 


Brevibacterium sp. 
HCU chnB2 


atgacgtcaaccatgcctgcac 
(SEQ ID NO:70) 


cacttaagtcgcattcagccc 
(SEQ ID NO:71) 


Acinetobacter sp. 
SE19c/?nB 


atggattttgatgctatcgtg 
(SEQ ID NO:72) 


ggcattggcaggttgcttg 
(SEQ ID NO:73) 


Arthrobacter sp. 
BP2 chnB 


(SEQ ID NO:74) 


tcaaagccgcggtatccg 
(SEQ ID NO:75) 


Rhodococcus sp. 
phil chnB 


atgactgcacagatctcacccac 
(SEQ ID NO:76) 


(SEQ ID NO:77) 


Rhodococcus sp. 
phi2 chnB 






(SEQ ID NO:78) 


(SEQ ID NO:79) 


Acidovorax sp. CHX 
chnB 


(SEQ ID NO:80) 


cagtggttggaacgcaaagcc 
(SEQ ID NO:81) 



Following amplification, the chnB gene fragments were cloned into 
15 pTrcHis-TOPO TA vectors with either an N-terminal tail or C-terminal tail, 

as provided by the vector sequence (N-terminal tail for Brevibacterium sp. 

HCU, Rhodococcus sp. phil, Rhodococcus sp. phi2, and Arthrobacter sp. 

BP2 monooxygenases; C-terminal tail for Acinetobacter sp. SE19 and 

Acidovorax sp. CHX monooxygenases). These vectors were transformed 
20 into E. coli, with transformants grown in Luria-Bertani broth supplemented 

with ampicillin (100 ug/ml) and riboflavin (0.1 ug/ml) at 30°C until the 

absorbance at 600 nm (A600) reached 0.5. When the A600 was reached, 

the temperature was shifted to 16°C. 
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The encoded monooxygenase sequences were expressed upon 
addition of IPTG to the culture media, 30 min after the temperature shift to 
1 6°C. The cultures were grown further overnight (1 4 hrs) and harvested 
by centrifugation in a cold centrifuge. The cells were treated with 

5 lysozyme (100 mg/ml) for 30 min on ice and sonicated. Following 
sonication, cell extracts were centrifuged and the supernatant was 
equilibrated with Ni-NTA resin (Qiagen, Valencia, CA) for 1 hr at 4°C. 
Protein bound resin was washed successively with increasing 
concentrations of imidazole buffer until the protein of interest was 

10 released from the resin. The purified protein was concentrated and the 
buffer exchanged to remove the imidazole. The protein concentration was 
adjusted to 1 ug/ml. 

EXAMPLE 8 

Assays of chnB Monooxygenase Activities of Brevibacterium sp. HCU. 

15 Acinetobacter SE19. Rhodococcus sd. Phil. Rhodococcus sp. phi2. 
Arthrobacter sp. BP2 and Acidovorax sp. CHX. 
The chnB monooxygenase activity of each over-expressed enzyme . 
from Example 7 was assayed against various ketone substrates: 
cyclobutanone, cyclopentanone, 2-methylcyclopentanone, 

20 cyclohexanone, 2-methylcyclohexanone, cyclohex-2-ene-1-one, 1 ,2- 
cyclohexanedione, 1 ,3-cyclohexanedione, 1 ,4-cyclohexanedione, 
cycloheptanone, cyclooctanone, cyclodecanone, cycloundodecanone, 
cyclododecanone, cyclotridecanone, cyclopentadecanone, 2-tridecanone, 
2-phenylcyclohexanone, diheyl ketone, norcamphor, beta-ionone, 

25 oxindole, levoglucosenone, dimethyl sulfoxide, dimethyl-2-piperidone, and 
phenylboronic acid. Compounds were selected on the basis of previous 
observations by van der Werf {J. Biochem. 347:693-701 (2000)) and 
Miyamoto et al. (Biochimica et Biophysica Acta 1 251 : 1 1 5-1 24 (1 995)) 
and by searches for the ketone substructure. 

30 All compounds were obtained from Sigma-Aldrich with only two 

exceptions. Levoglucosenone was obtained from Toronto Reseach 
Chemicals, Inc. and dimethyl-2-piperidone was prepared according to 
U.S. Patent 6,077,955. For enzyme assays all compounds were 
dissolved to a concentration of 0.1 M in methanol, with the exceptions of 

35 norcamphor (dissolved in ethyl acetate), cyclododecanone, 

cycltridecanone and cyclopentadecanone (dissolved in propanol), and 
levoglucosenone (dissolved with acetone). 
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The monooxygenase activity of each over-expressed enzyme was 
assayed spectrophotometrically at 340 nm by monitoring the oxidation of 
NADPH. Assays were performed in individual quartz cuvettes, with a 
pathlength of 1 cm. The following components were added to the cuvette 

5 for the enzyme assays: 380 ul of 33.3 mM MES-HEPES-sodium acetate 
buffer (pH 7.5), 5 pi of 0.1 M substrate (1.25 mM final concentration), 
10 ul of 1 ug/pl enzyme solution (10 ng total, 0.025 ng/pl) and 5 ul NADPH 
(1.2 M, 15 mM final concentration ). An Ultrospec 4000 (Pharmacia 
Biotech, Cambridge, England) was used to read the absorbance of the 

10 samples over a two to ten minute time period and the SWIFT (Pharmacia 
Biotech) program was used to calculate the slope of the reduction in 
absorbance over time. For the Brevibacterium sp. HCU chnB2, the rates 
were multiplied by a factor of 3.25 to adjust for decrease in activity due to 
storage as suggested by the literature (J. Bacteriol. 2000. 182: 

15 p.4241-4248). Monooxygenase activity of each over-expressed enzyme is 
shown in Table 5, with respect to each ketone substrate. The specific 
activity values listed are given in umol/min/mg. The notation "ND" refers 
to "No Activity Detected". 

Graphical representation of the data shown in Table 5 is also 

20 provided in Figures 1 , 2, 3, 4, and 5. 



Table 5 

Specific Activity of Monooxygenase Enzymes Against Various 
Ketone Substrates 





Species 


Compound 


sp. 

HCU 

chnB1 


sp. 

HCU 

chnB2 


sp. 

SE19 

chnB 


sp. 

BP2 

chnB 


sp. 

CHX 

chnB 


sp. 
phil 
chnB 


sp. 

phi2 

chnB 


Norcamphor 


0.410 


1.331 


4.474 


2.842 


0.166 


1.504 


2.816 


Cyclobutanone 


ND 


0.374 


0.109 


0.128 


ND 


0.102 


0.154 


Cyclopentanone 


ND 


1.331 


3.034 


1.491 


0.621 


1.370 


2.451 


2-methyl- 
cyclopentanone 


1.395 


0.874 


8.378 


3.514 


0.627 


3.392 


6.445 


Cyclohexanone 


2.765 


1.726 


6.349 


3.565 


0.397 


3.680 


3.750 
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Species 


Compound 


sp. 

HCU 

chnB1 


sp. 

HCU 

chnB2 


sp. 

SE19 

chnB 


sp. 

BP2 

chnB 


sp. 

CHX 

chnB 


sp. 
phM 
chnB 


sp. 

phi2 

chnB 


2-methyl- 
cyclohexanone 


2.714 


1.622 


9.990 


4.205 


0.627 


4.774 


5.952 


Cyclohex-2-ene-1- 
one 


0.435 


0.541 


5.357 


2.739 


0.666 


2.694 


3.091 


1,2- 

cyclohexanedione 


0.787 


0.416 


0.077 


0.237 


0.096 


0.083 


ND 


1,3- 

cyclohexanedione 


0.237 


0.978 


0.237 


0.397 


0.032 


ND 


0.141 


1,4- 

cyclohexanedione 


3.405 


1.123 


8.346 


3.994 


0.794 


3.302 


6.150 


Cycloheptanone 


0.646 


0.374 


8.422 


3.846 


0.608 


3.622 


6.234 


Cyclooctanone 


ND 


ND 


1.984 


0.646 


0.410 


0.627 


0.141 


Cyclodecanone 


ND 


ND 


0.320 


0.166 


0.160 


0.077 


0.205 


Cycloundecanone 


ND 


0.125 


0.064 


0.064 


0.058 


ND 


0.051 


Cyclododecanone 


ND 


0.229 


0.122 


0.198 


0.051 


ND 


0.122 


Cyclotridecanone 


ND 


ND 


0.166 


0.147 


ND 


ND 


0.109 


Cyclopenta- 
decanone 


ND 


ND 


0.109 


0.122 


ND 


0.122 


ND 


2-tridecanone 


ND 


0.187 


ND 


ND 


0.096 


0.160 


1.690 


dihexyl ketone 


ND 


0.270 


ND 


ND 


ND 


0.160 


ND 


2-phenyl- 
cyclohexanone 


1.459 


0.104 


5.370 


ND 


0.192 


1.050 


0.730 


Oxindole 


2.438 


0.229 


7.091 


4.845 


0.307 


3.411 


4.858 


Levoglucosenone 


ND 


ND 


1.126 


0.525 


0.147 


0.461 


0.506 
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Species 


Compound 


sp. 

HCU 

chnB1 


sp. 

HCU 

chnB2 


sp. 

SE19 

chnB 


sp. 

BP2 

chnB 


sp. 

CHX 

chnB 


sp. 
phil 
chnB 


sp. 

phi2 

chnB 


dimethyl sulfoxide 


0.230 


ND 


0.819 


0.422 


0.358 


0.518 


0.544 


dimethy-2- 
piperidone 


2.822 


0.354 


8.384 


4.154 


0.557 


3.539 


6.509 


Phenylboronic acid 


1.606 


ND 


0.102 


0.192 


ND 


ND 


0.109 


beta-ionone 


0.109 


0.374 


3.347 


1.485 


0.544 


2.707 


0.544 



EXAMPLE 9 

Cloning Of Rhodococcus ervthropolis AN 12 Monooxvaenase 
Genes into Escherichia coll 
5 This example illustrates the construction of a suite of recombinant 

E. coli, each containing a full length BVMOs from Rhodococcus 
erythropolis AN 12. 

Full length BV monooxygenases were PCR amplified, using 
chromosomal DNA as the template and the primers shown below in Table 
10 6. 

Table 6 

Primers Used for Amplification of Full-Len ath BV Rhodococcus 
ervthrooolis AN 12 Monooxvaenases 



chnB 

Mono- 

oxygenase 


Forward Primer 


Reverse Primer 


ORF8 


atg age aca gag ggc aag tac gc 
(SEQ ID NO:82) 


[tea] gtc ctt gtt cac gta gta ggc c 
(SEQ ID NO:83) 


ORF9 


atg gtc gac ate gac cca acc to 
(SEQ ID NO:84) 


tta teg get cct cac ggt tte teg 
(SEQ ID NO:85) 


ORF10 


atg acc gat cct gac tte tec acc 
(SEQ ID NO:86) 


tea tgc gtg cac cgc act gtt cag 
(SEQ ID NO:87) 


ORF11 


atg age ccc tec ccc ttg ccg ag 
(SEQ ID NO:88) 


tea tgc gcg ate cgc ctt etc gag 
(SEQ ID NO:89) 
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chnB 

Mono- 

oxyqenase 


Forward Primer 


Reverse Primer 


ORF12 


gtg aac aac gaa tot gac cac ttc 
(SEQ ID NO:90) 


tea tgc ggt gta etc egg ttc eg 
(SEQ ID NO:91) 


ORF 13 


atg age acc gaa cac etc gat g 
(SEQ ID NO:92) 


tea act ctt get egg tac egg eg 
(SEQ ID NO:93) 


ORF14 


atg aca gac gaa ttc gac gta gtg at 
(SEQ ID NO:94) 


tea get ctg gtt cac agg gac gg 
(SEQ ID NO:95) 


ORF 15 


atg gcg gag ata gtc aat ggt cc 
(SEQ ID NO:96) 


tea ccc teg cgc ggt egg agt c 
(SEQ ID NO:97) 


ORF 16 


gtg aag ctt ccc gaa cat gtc gaa ac 
(SEQ ID NO:98) 


tea tgc ctg gac get ttc gat ctt g 
(SEQ ID NO:99) 


ORF 17 


atg aca cag cat gtc gac gta ctg a 
(SEQ ID NO:100) 


eta tgc get ggc gac ctt get ate 
(SEQ ID NO:101) 


ORF 18 


atg tea tea egg gtc aac gac ggc c 
(SEQ ID NO:102) 


tea tec ttt gec tgt cgt cag tgc 
(SEQIDNO:103) 


ORF 19 


atg act aca caa aag gec ctg acc 
(SEQ ID NO:104) 


tea ggc gtc gac ggt gtc ggc c 
(SEQIDNO:105) 


ORF 20 


atg aca act acc gaa tec aga act c 
(SEQ ID NO:106) 


tea gcg cag att gaa gec ctt gta tc 
(SEQIDNO:107) 



Following amplification, the gene fragments were cloned into 
pTrcHis-TOPO TA vectors with either an N-terminal tail or C-terminal tail, 
as provided by the vector sequence. These vectors were transformed into 
5 E. coli, with transformants grown in Luria-Bertani broth supplemented with 
ampicillin (100 ug/ml). 

EXAMPLE 10 

Assays of chnB Monooxvoenase Activities of Rhodo coccus ervthropolis 
AN12 

10 The chnB monooxygenase activity of each expressed enzyme from 

Example 9 was tested for activity according to its ability to convert 
cyclohexanone to caprolactone. 
Conversion of Cvclohexanone to Caprolactone. 

Clones containing the full length monooxygenase genes were 

15 transferred from LB agar plate to 5 mL of M63 minimal media (GIBCO) 
containing 10 mM glycerol, 50 ug/mL ampicillin, 0.1 mM IPTG, and 
500 mg/L cyclohexanone. In addition to the clones containing full length 
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monooxygenases, a plasmid without an insert and a "no cell" control were 
also assayed. The encoded monooxygenase sequences were expressed 
upon addition of IPTG to the culture media. The cultures were incubated 
overnight at room temperature (24°C). Samples (1 .25 mL) for analysis 

5 were taken immediately after inoculation and after overnight incubation; 
cells were removed by centrifugation (4°C, 13,000 rpm). 
GC-MS Detection of Caprolactone 

Caprolactone formed by the action of the cloned monooxygenase 
was extracted from the aqueous phase with ethylacetate (1 .0 ml 

10 aqueous/0.5 mL ethylacetate). Caprolactone was detected by gas 

chromotagraphy mass spectrometry (GC-MS) analysis, using an Agilent 
6890 Gas chromatograph system. 

The analysis of the ethylacetate phase was performed by injecting 
1 uL of the ethyl acetate phase into the GC. The inlet temperature was 

15 1 1 5°C and the column temperature profile was 50° C for 4 min and 
ramped to 250°C at 20°C/min, for a total run time of 14 min. The 
compounds were separated with an Hewlet Packard HP-5MS (5% phenyl 
Methyl Siloxane) column (30 m length, 250 urn diameter, and 0.25 urn film 
thickness). The mass spectrometer was run in Electron Ionization mode. 

20 The background mass spectra was subtracted from the spectra at the 
retention time of caprolactone (9.857 min). Presence of caprolactone was 
confirmed by comparison of the test reactions to an authentic standard 
obtained from Aldrich Chemical Company (St. Louis, MO). 

Results of these assays are shown below in Table 7, in terms of the 

25 presence or absence of detectable caprolactone formation according to 
the activity of each expressed BV monooxygenase enzyme. 
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Table 7 

Ability of Monooxyqenase Enzymes to Convert Cvclohexanone to 
Caprolactone 



Formation of Caprolactone 



Detected 



Not Detected 



Not Assayed 



chnB 
Monooxygenases 



ORF8 
ORF9 
ORF11 
ORF12 
ORF16 
ORF 17 
ORF18 
ORF19 



ORF 15 
No cell control 
Plasmid control 



ORF 10 
ORF 13 
ORF 14 
ORF 20 



5 



EXAMPLE 11 

Identification of Signature Sequences Between Families of BV 

Monooxygenases 
Sequence analysis of the 20 genes encoding Baeyer-Villiger 
10 monooxygenases identified in the previous examples allows definition of three 
different BV signature sequence families based on amino acid similarities. Each 
family possesses several member genes for which biochemical validation of the 
enzyme as a functional BV enzyme capable of the oxidation of cyclohexanone 
was demonstrated (Examples, supra). Sequence alignment of the homologues 
15 for each family was performed by Clustal W alignment (Higgins and Sharp (1989) 
CABIOS. 5:1 51-153). This allows the identification of a set of amino acids that 
are conserved at specific positions in the alignment created from all the 
sequences available. 



The results of these Clustal W alignments are shown in Figures 7, 8, and 9 



20 for BV Familyl , BV family 2, and BV Family 3. In all cases, an "*" indicates a 
conserved signature amino acid position. The conserved amino acid signature 
sequence for each Family is shown in Figure 6, along with the signature 
sequence P-# positions. This conserved amino acid/ position set becomes a 
signature for each family. Any new protein with a sequence that can be aligned 

25 with those of the existing members of the family and which includes at the specific 
positions a at least 80% of the signature sequence amino acids can be 
considered a member of the specific family. 
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BV Family 1 

This family comprises the chnB monooxygenase sequences of 
Arthrobacter sp. BP2 (SEQ ID NO:12), Rhodococcus sp. phil (SEQ ID 
NO:8), Rhodococcus sp. phi2 (SEQ ID NO:1 0), Acidovorax sp. CHX (SEQ 

5 ID NO:14), Brevibacterium sp. HCU (SEQ ID NOs:1 6 and 1 8), and 
Rhodococcus erythropolis AN12 ORF10, ORF14, ORF19, and ORF20 
(SEQ ID NOs:26, 34, 44 and 46). Within a length of 540 amino acids, a 
total of 74 positions are conserved (100%).This signature sequence of 
Family 1 BV monooxygenases is shown beneath each alignment of 

10 proteins (Figure 7) and is listed as SEQ ID NO:47. The ability to identify 
the signature sequence within this family of proteins was made possible 
by: 1 ) the number of sequences of BV monooxygenases; and 2) the 
characterization of their activity as BV-monooxygenases. 

Based on the limited number (4 total) of BV monooxygenase 

15 sequences in the public domain, for which biochemical data is also 
available, 3 of these sequences align with the signature sequence 
discovered for Family 1 . These sequences are: 

(1 ) Acinetobacter sp. NCIMB9871 chnB (NCBI Accession 
Number AB026668, based on Chen, Y.C. et al. (J Bacterid. 

20 1 70(2):781 -789 (1 988)). Key biochemical characterization of this protein 
was performed by Donogue et al. (Eur J Biochem. 16;63(1):1 75-92 
(1976)), Trudgill et al, (Methods Enzymol. 188:70-77 (1990)), and Iwaki et 
al. (Appl Environ Microbiol. 65(1 1):5158-62 (1999)). This enzyme shares 
72 of the 74 conserved amino acids in the signature sequence of Family 1 

25 BV monooxygenases. 

(2) Rhodococcus erythropolis HmB (NCBI Accession Number 
AJ272366, based on the work of Barbirato et at. (FEBS Lett. 438 (3): 
293-296 (1998)) and van der Werf et al. {Biol. Chem. 274 (37): 
26296-26304 (1999)). Key biochemical characterization of this protein was 

30 performed by van der Werf, M.J. et al. {Microbiology 146 ( Pt 5):1 129-41 
(2000); Biochem J. 1;347 Pt 3:693-701 (2000); and Appl Environ 
Microbiol. 65(5):2092-102 (1 999)). This enzyme is known as a carvone 
monooxygenase 

(3) Rhodococcus rhodochrous smo (NCBI Accession Number 
35 AB010439). This enzyme was sequenced and characterized by Morii, S. 

et al. (J. Biochem. 126 (3), 624-631 (1999)). This enzyme is known as a 
steroid monooxygenase. It shares 74 of the 74 conserved amino acids in 
the signature sequence of Family 1 BV monooxygenases. 
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The enzymes described in the public domain having the highest 
sequence similarity to Group 1 have been characterized as dimethylaniline 
hydroxylases. 
BV Family 2 

5 This family comprises the chnB monooxygenase sequences of 

Rhodococcus ervthropolis AN1 2 ORF9, ORF12, ORF15, ORF 16, and 
ORF18 (SEQ ID NOs:24, 30, 36, 38, and 42). Within a length of 
497 amino acids, a total of 76 positions are conserved (100%). This 
signature sequence for Family 2 BV monooxygenases is shown beneath 

10 each alignment of proteins (Figure 8) and is listed as SEQ ID NO:48. The 
ability to identify the signature sequence within this family of proteins was 
made possible by: 1) the number of sequences of BV monooxygenases; 
and 2) the characterization of their activity as BV-monooxygenases. 
Based on the limited number (4 total) of BV monooxygenase 

15 sequences in the public domain, for which biochemical data is also 
available, only 1 of these sequences align with the signature sequence 
discovered for Family 2. This sequence is Pseudomonas putida JD1 Key 
biochemical characterization of this protein was performed by Tanner A., 
et al. (J Bacteriol. 182(23):6565-6569 (2000)). This enzyme is known as 

20 an acetophenone monooxygenase. It shares 69 of the 76 conserved 
amino acids in the signature sequence of Family 2 BV monooxygenases. 
, BV Family 3 

This family comprises the chnB monooxygenase sequences of 
Rhodococcus ervthropolis AN12 ORF8, ORF 1 1 , ORF 13, and ORF17 

25 (SEQ ID NOs:22, 28, 32, and 40). Within a length of 471 amino acids, a 
total of 41 positions are conserved (100%). This signature sequence for 
Family 3 BV monooxygenases is shown beneath each alignment of 
proteins (Figure 9) and is listed as SEQ ID NO:49. The ability to identify 
the signature sequence within this family of proteins was made possible 

30 by: 1 ) the number of sequences of BV monooxygenases; and 2) the 
characterization of their activity as BV-monooxygenases. 

There are no sequences in the public domain with demonstrated 
BV activity that belong to this group. The dimethylaniline N-oxidase 
shares only 30 amino acids out of 41 conserved amino acids discovered 

35 in the signature sequence, which represents less than 80% of the 
conserved positions. 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid fragment selected from the group 
consisting of: 

(a) an isolated nucleic acid fragment encoding a Baeyer- 
Villiger monooxygenase polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID 
NOs:8, 10, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 
and 46; 

(b) an isolated nucleic acid molecule encoding a Baeyer- 
Villiger monooxygenase polypeptide that hybridizes with 
(a) under the following hybridization conditions: 0.1X SSC, 
0.1% SDS, 65°C and washed with 2X SSC, 0.1% SDS 
followed by 0.1X SSC, 0.1% SDS; or 

an isolated nucleic acid fragment that is complementary to (a) or (b). 

2. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 542 amino acids that has at 
least 55% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:8 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

3. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 541 amino acids that has at 
least 53% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:10 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

4. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 439 amino acids that has at 
least 37% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:22 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

5. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 518 amino acids that has at 
least 44% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
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ID NO:24 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

6. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 541 amino acids that has at 
least 64% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:26 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

7. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 462 amino acids that has at 
least 65% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:28 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

8. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 523 amino acids that has at 
least 45% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:30 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

9. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 493 amino acids that has at 
least 55% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:32 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

10. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 539 amino acids that has at 
least 51% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:34 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

11. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 649 amino acids that has at 
least 39% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:36 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 
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1 2. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 494 amino acids that has at 
least 43% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:38 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

1 3. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 499 amino acids that has at 
least 53% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:40 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

14. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 493 amino acids that has at 
least 44% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:42 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

1 5. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 541 amino acids that has at 
least 54% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:44 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

1 6. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 545 amino acids that has at 
least 42% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:46 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

17. The isolated nucleic acid fragment of Claim 1 selected from the 
group consisting of SEQ ID NOs:7, 9, 21, 23, 25, 27, 29, 31 , 33, 35, 37, 
39,41,43, and 45. 

1 8. An isolated nucleic acid fragment of Claim 1 isolated from 
Rhodococcus. 

19. A polypeptide encoded by the isolated nucleic acid fragment of 
Claim 1. 
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20. The polypeptide of Claim 1 9 selected from the group consisting 
of SEQ ID NOs:8, 10, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, and 
46. 

21 . An isolated nucleic acid fragment selected from the group 
consisting of: 

(a) an isolated nucleic acid fragment encoding a Baeyer- 
Villiger monooxygenase polypeptide having an amino 
acid sequence as set forth in SEQ ID NO:12; 

(b) an isolated nucleic acid molecule encoding a Baeyer- 
Villiger monooxygenase polypeptide that hybridizes with 
(a) under the following hybridization conditions: 0.1 X SSC, 
0.1 % SDS, 65°C and washed with 2X SSC, 0.1 % SDS 
followed by 0.1X SSC, 0.1% SDS; or 

an isolated nucleic acid fragment that is complementary to (a), or (b). 

22. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 532 amino acids that has at 
least 57% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:1 1 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

23. An isolated nucleic acid fragment of Claim 21 isolated from 
Arthrobacter. 

24. A polypeptide encoded by the isolated nucleic acid fragment of 
Claim 21. 

25. The polypeptide of Claim 24 as set forth in SEQ ID NO: 12. 

26. An isolated nucleic acid fragment selected from the group 
consisting of: 

(a) an isolated nucleic acid fragment encoding a Baeyer- 
Villiger monooxygenase polypeptide having an amino 
acid sequence as set forth in SEQ ID NO: 18; 

(b) an isolated nucleic acid molecule encoding a Baeyer- 
Villiger monooxygenase polypeptide that hybridizes with 
(a) under the following hybridization conditions: 0.1X SSC, 
0.1% SDS, 65°C and washed with 2X SSC, 0.1% SDS 
followed by 0.1X SSC, 0.1% SDS; or 

an isolated nucleic acid fragment that is complementary to (a), or (b). 

27. An isolated nucleic acid molecule comprising a first nucleotide 
sequence encoding a polypeptide of at least 538 amino acids that has at 
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least 57% identity based on the Smith-Waterman method of alignment 
when compared to a polypeptide having the sequence as set forth in SEQ 
ID NO:17 or a second nucleotide sequence comprising the complement of 
the first nucleotide sequence. 

28. An isolated nucleic acid fragment of Claim 26 isolated from 
Acidovorax. 

29. A polypeptide encoded by the isolated nucleic acid fragment of 
Claim 26. 

30. The polypeptide of Claim 29 selected from the group consisting 
ofSEQIDNO:18. 

31 . A chimeric gene comprising the isolated nucleic acid fragment 
of any one of Claims 1,19, 25, 30, or 35 operably linked to suitable 
regulatory sequences. 

32. A transformed host cell comprising a host cell and the chimeric 
gene of Claim 31. 

33. The transformed host cell of Claim 32 wherein the host cell is 
selected from the group consisting of bacteria, yeast, filamentous fungi, 
and green plants. 

34. The transformed host cell of Claim 33 wherein the host cell is 
selected from the group consisting of proteobacteria and actinomycetes. 

35. The transformed host cell of Claim 34 wherein the host cell is 
selected from the group consisting of Burkholderia, Alcaligenes, 
Pseudomonas, Sphingomonas, Pandoraea, Delttia and Comamonas. 

36. The transformed host cell of Claim 33 wherein the host cell is 
selected from the group consisting of Rhodococcus, Acinetobacter, 
Mycobacteria, Nocardia, Arthrobacter, Brevibacterium, Acidovorax, 
Bacillus, Streptomyces, Escherichia, Salmonella, Pseudomonas, 
Aspergillus, Saccharomyces, Pichia, Candida, Comyebacterium, and 
Hansenula. 

37. The transformed host cell of Claim 33 wherein the host cell is 
selected from the group consisting of soybean, rapeseed, sunflower, 
cotton, com, tobacco, alfalfa, wheat, barley, oats, sorghum, rice, 
Arabidopsis, cruciferous vegetables, melons, carrots, celery, parsley, 
tomatoes, potatoes, strawberries, peanuts, grapes, grass seed crops, 
sugar beets, sugar cane, beans, peas, rye, flax, hardwood trees, softwood 
trees, and forage grasses 

38. A method of obtaining a nucleic acid fragment encoding a 
Baeyer-Villiger monooxygenase polypeptide comprising: 
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(a) probing a genomic library with the nucleic acid fragment of 
any one of Claims 1 , 21 , or 26; 

(b) identifying a DNA clone that hybridizes with the nucleic 
acid fragment of any one of Claims 1 , 21 , or 26; 

(c) sequencing the genomic fragment that comprises the 
clone identified in step (b); 

wherein the sequenced genomic fragment encodes a Baeyer-Villiger 
monooxygenase polypeptide. 

39. A method of obtaining a nucleic acid fragment encoding a 
Baeyer-Villiger monooxygenase polypeptide comprising: 

(a) synthesizing at least one oligonucleotide primer 
corresponding to a portion of the isolated nucleic acid 
sequence of any one of Claims 1 , 21 , or 26; and 

(b) amplifying an insert present in a cloning vector using the 
oligonucleotide primer of step (a); 

wherein the amplified insert encodes a Baeyer-Villiger 
monooxygenase polypeptide. 

40. A method for the identification of a polypeptide having 
monooxygenase activity comprising: 

(a) obtaining the amino acid sequence of a polypeptide 
suspected of having monooxygenase activity; and 

(b) aligning the amino acid sequence of step (a) with the 
amino acid sequence of a Baeyer-Villiger monooxygenase 
consensus sequence selected from the group consisting 
of SEQ ID NO:47, SEQ ID NO:48 and SEQ ID NO:49; 

wherein where at least 80% of the amino acid residues at positions 
p1-p74 of SEQ ID NO:47, or at least 80% of the amino acid residues at 
p1-p76 of SEQ ID NO:48 or at least 80% of the amino acid residues of 
p1-p41 of SEQ ID NO:49 are completely conserved, the polypeptide of (a) 
is identified as having monooxygenase activity. 

41 . A method according to Claim 40 wherein least 1 00% of the 
amino acid residues at positions p1-p74 of SEQ ID NO:47, or at least 
100% of the amino acid residues at p1-p76 of SEQ ID NO:48 or at least 
100% of the amino acid residues of p1-p41 of SEQ ID NO:49 are 
completely conserved. 

42. A method for identifying a gene encoding a Baeyer-Villiger 
monooxygenase polypeptide comprising: 
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(a) probing a genomic library with a nucleic acid fragment 
encoding a polypeptide wherein where at least 80% of 
the amino acid residues at positions p1- p74 of SEQ ID 
NO:47, or at least 80% of the amino acid residues at p1- 
p76 of SEQ ID NO:48 or at least 80% of the amino acid 
residues of p1-p41 of SEQ ID NO:49 are completely 
conserved; 

(b) identifying a DNA clone that hybridizes with a nucleic acid 
fragment of step (a); 

(c) sequencing the genomic fragment that comprises the 
clone identified in step (b); 

wherein the sequenced genomic fragment encodes a Baeyer-Villiger 
monooxygenase polypeptide. 

43. A method according to Claim 42 wherein least 1 00% of the 
amino acid residues at positions p1- p74 of SEQ ID NO:47, or at least 
100% of the amino acid residues at p1-p76 of SEQ ID NO:48 or at least 
100% of the amino acid residues of p1-p41 of SEQ ID NO:49 are 
completely conserved. 

44. The product of either of Claims 40 or 42. 

45. A method for the biotransformation of a ketone substrate to the 
corresponding ester, comprising: contacting a transformed host cell under 
suitable growth conditions with an effective amount of ketone substrate 
whereby the corresponding ester is produced, said transformed host cell 
comprising a nucleic acid fragment encoding an isolated nucleic acid 
fragment of any of Claims 1,21, 26 or 44; under the control of suitable 
regulatory sequences. 

46. The method of Claim 45 wherein the ketone substrate is 
selected from the group consisting of cyclic ketones and ketoterpenes 
having the general formula: 



R C R 1 

wherein R and Ri are independently selected from substituted or 
unsubstituted phenyl, substituted or unsubstituted alkyl, or substituted or 
unsubstituted alkenyl or substituted or unsubstituted alkylidene. 
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47. The method of Claim 46 wherein the ketone substrate is 
selected from the group consisting of Norcamphor, Cyclobutanone, 
Cyclopentanone, 2-methyl-cyclopentanone, Cyclohexanone, 2-methyl- 
cyclohexanone, Cyclohex-2-ene-1-one, 1 ,2-cyclohexanedione, 1,3- 
cyclohexanedione, 1 ,4-cyclohexanedione, Cycloheptanone, 
Cyclooctanone, Cyclodecanone, Cycloundecanone, Cyclododecanone, 
Cyclotridecanone, Cyclopenta-decanone, 2-tridecanone, dihexyl ketone, 
2-phenyl-cyclohexanone, Oxindole, Levoglucosenone, dimethyl sulfoxide, 
dimethy-2-piperidone, Phenylboronic acid, and beta-ionone. 

48. A method for the in vitro transformation of a ketone 
substrate to the corresponding ester, comprising: contacting a ketone 
substrate under suitable reaction conditions with an effective amount of a 
Baeyer-Villiger monooxygenase enzyme, the enzyme having an amino 
acid seqeunce selected from the group consisting of SEQ ID NOs:8, 10, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, and 46. 

49. A method according to Claim 49 wherein the ketone substrate 
is selected from the group consisting of cyclic ketones and ketoterpenes 
having the general formula: 



R C Ri 

wherein R and are independently selected from substituted or 
unsubstituted phenyl, substituted or unsubstituted alkyl, or substituted or 
unsubstituted alkenyl or substituted or unsubstituted alkylidene. 

50. A method according to Claim 48 wherein the ketone substrate 
is selected from the group consisting of Norcamphor, Cyclobutanone, 
Cyclopentanone, 2-methyl-cyclopentanone, Cyclohexanone, 2-methyl- 
cyclohexanone, Cyclohex-2-ene-1-one, 1 ,2-cyclohexanedione, 1,3- 
cyclohexanedione, 1 ,4-cyclohexanedione, Cycloheptanone, 
Cyclooctanone, Cyclodecanone, Cycloundecanone, Cyclododecanone, 
Cyclotridecanone, Cyclopenta-decanone, 2-tridecanone, dihexyl ketone, 
2-phenyl-cyclohexanone, Oxindole, Levoglucosenone, dimethyl sulfoxide, 
dimethy-2-piperidone, Phenylboronic acid, and beta-ionone. 
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51 . A mutated microbial gene encoding a protein having an altered 
biological activity produced by a method comprising the steps of 

(i) digesting a mixture of nucleotide sequences with 
restriction endonucleases wherein said mixture comprises: 

a) a native microbial gene selected from the group 
consisting of SEQ ID NOs:7, 9, 11, 13, 15, 17, 19, 21, 
23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, and 45; 

b) a first population of nucleotide fragments which will 
hybridize to said native microbial sequence; 

c) a second population of nucleotide fragments which 
will not hybridize to said native microbial sequence; 

wherein a mixture of restriction fragments are produced; 

(ii) denaturing said mixture of restriction fragments; 

(iii) incubating the denatured said mixture of restriction 
fragments of step (ii) with a polymerase; 

(iv) repeating steps (ii) and (iii) wherein a mutated microbial 
gene is produced encoding a protein having an altered 
biological activity. 

52. An Acidovorax sp. comprising the 1 6s rDNA sequence as set 
forth in SEQ ID NO:5 

53. An Arthrobacter sp. comprising the 1 6s rDNA sequence as set 
forth in SEQ ID NO:1 

54. A Rhodococcus sp. comprising the 1 6s rDNA sequence as set 
forth in SEQ ID NO:6 

55. An isolated nucleic acid useful for the identification of a BV 
monooxygenase selected from the group consisting of SEQ ID 70-1 13. 
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FIGURE 6 

BVMO Family 1 consensus: 

MTAQESLTVVDAWIGAGFGGIYAVHKLREQGLTWGFDAADGPGGTWYWNRYPGALSOT 

TYPTQPEIIJ^LEDVVDRFDLRMFRFGTEVTSATYLEDENLWE 

ETIHTAAWPEGVDLTGKRVGVIGTGSTGIQVITALAPEVEHLTVFVRTO 

SGVAFGFEESTWi^SVSEEERNRWEEAWEEGGGFRPMFGTPGDIATDEAANETAASFIRSKIREIVKDPETARKIiTPTG 
LFARRRLCDIX3YYEVYNRPNVEAVDIKENPIREITAKGVVTED 

VTOGQPTSYLGLSTAGFPNWFMVLGPNGPFTNLPPSIETQVEWISDTIAYAEENGIRAIE^ 
TKADSWIFGAWPGKKPSVLFYLGGLGNYRAVLADVAAAGYRGFALKSADAVTA (SEQ ID NOt47) 



SiRpatnre Sequence Positions 
BVMO Family 1 
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BVMO Family 2 consensus; 

MVXIPXRHXEWIIGAGFAGIGAAVELKRXGIDDFVLLERAD^^ 

NP^TRLFAXQPEIYDYLEDVAWCXGLXXHVRFGVEVTEARTOESAQLWRVXTASGELTAXFLVAA 

XPKIPDLPGLESFEGXXFHSAXWNHDLDLRGERVAWGTGASAVQFVPEIADXAXTLTVFQRTPQWVLPRP 

DXTLPXAXRAWSRVPGTQKWIJUatfiYGIFEAIXSSGFVXPX^ 

EGRSLADHWNGSAXAYLGTAVSGFPI^FXLLGPNTGLGHTSIVXILEAQAEYIASAIjXXMRREGIjGALD^ 
AEVQXXFNXAVQERLATTVWNAGGCSSWYXDPDGRNSTXWPWSTXXFRARTRRFDPSDYXPSSPTPETXXG 
(SEQ XD NOl48) 



Signature Sequence Positions 
BVMO Family 2 
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BVMO Family 3 consensus: 

MSTEHLDVLIIGAGLSGIGAAXRLXREXGIXFAII«EA^ 

HEI^YVRDTAXEXGLRXHIXFGTKWAAXXXAXSLWTOWXXXGETEVXTYNV^ 
HPQXYPFJCLDYRGKKVWIGSGASGXTIAPXMXXXAXHVTML^^ 
XAKRLXLLLIRRQLGKNVXLXGFPTPSYXPWDQHI^^ 
ATGLHXXXGGPFIXXDGLLVDLXXRXMiFYKXXXXSDNLNPLGXVGYTO 

AXAEXXXXLIASGYKXRXXGXMPXQGXKXXWXXXXNYXXDRXIJCXX (SEQ 



Signature Seqnence Positions 
BVMO Family 1 
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2005 MTDE FDWIVGAGLAGMQMIjHEVR-MVGLTAKV 

1273 MTDPDFSTAP LDVWIGAGVAGMYAMHRLR-EQGLRVHG 

Arthrobacter MTAQNTFQT VDAWIGAGFGGIYAVHKLHNEQGLTWG 

2082 MTTQKALTT VDAIVIGAGFGGIYAVHKLANELGLTTVG 

RhodoCOCCUS-phi2-MonO MTAQTIHT VDAWIGAGFGGIYAVHKLHHELGLTTVG 

RhodoCOCCUS -phi 1 -Mono MTAQISPTV VDAWIGAGFGGIYAVHKLHNEQGLTWG 

Acidovorax MSSSPSSAIH FDAIWGAGFGGMYMLHKLRDQLGLKVKV 

Brevibacterium-Monol MPITQQLD HDAIVIGAGFSGLAILHHLR-EIGLDTQI 

2093 MTTTESRTQTDKAGAVTLDALI IGAGVAGLYQLHMLR-EQGLNVRA 

Brevibacterium-Mono2 MTSTMPAPTAAQAN-ADETEVLDALIVGGGFSGPVSVDRLR-EDGFKVKV 



2005 FEAGGGAGGTWYWNRYPGARCDVESLEYSYQFSEVLQQEWEWTRRYADQA 

1273 FEAGSGVGGTWYFNRYPGARCDVESFDYSYSFSEELQQDWDWSEKYAAQP 

Arthrobacter FDKADGPGGTWYWNRYPGALSOTESHVYRFSFDKGLLQDGTWKHTYITQP 

2082 FDKADGPGGTWYWNRYPGALSDTESHVYRFSFDRDLLQDGTWKHTYTTQP 

RhodocoCCUS-phl2-Mono FDKAIX3PGGTWYWNRYPGALSDTESHLYRFSFDRDLLQDGTWKNTYVTQP 

Rhodococcus-phil-Mono F DKADG PGGTWYWNRY PGAL SDTESHLYRFSFDRDLLQDGTWKTTYI TQP 

Acidovorax FOTAGGIGGTWYWNRYPGALSDTHSHVYQYSFDEAMLQEWTWKNKYLTQP 

Brevibacterium-Monol VEATDGIGGTWWINRYPGVRTDSEFHYYSF SF SKEVRDEWTWTQRYPDGE 

2093 YDAAEDVGGTWYWNRYPGARFDSEAYIYQYLFSEDLYKNWSWSQRFPAQP 

Brevibacterium-Mono2 WDAAGGFGGIWWWNCYPGARTDSTGQIYQFQY-KDLWKDFDFKELYPDFN 



2005 EIMRYISHWETFDIiARDIRFHTOVFAMTYEETTARVm/QTDSAGEWAK 

1273 EILSYLDHVADRFDLRTGFTFiyraVLSAQFDEGTATWWQTDGGHnVTSR 

Arthrobacter EILEYLEDWDRFDLRRHFRFGTEVKSATYLEDEGLWEVTTGGGAVYRAK 

2082 EILEYIxEDWSRPDLRRHFHFGTAVESAWLFJ3EQLWEVTTDTGEIYRAT 

RhodoCOCCUS -phi2 -Mono EILEYIiEDWDRFDLRRHTRFGTEVTSAIYLDDENLWEVOTDGGDWRAT 

Rhodococcus-phil-Mono EII^LESWDRPDLRRHFPJGTEVTSAIYI^ENLWEVSTDKGEVYRAK 

Acidovorax EIIAYLEWADRLDI^PDIQIjNTTVTSMHFNEVHNIWEVRTDRGGYYTAR 

Brevibacterium-Monol EVCAYIJff IADRLDLRKDIQLNSRVNTARWNETEKYWDVIFEDGSSKRAR 

2093 EIERWMRYVADTLDLRRSIQFSTTITSAEFDEVAERWTIRTDRGEEISTR 

Brevibacterium-Mono2 GVREYFEYVDSQLDLSRDVTFNTFAESCTWDDAAKEWTVRSSEGREQRAR 



2005 FVIMATGCLSEPNVPYIPGVETPAGDVLHTGRWPQDPVDFTGKRVGVIGT 

1273 FWCATGSLSTANVPNIAGRETFGGDWHTGFWPHEGVDFTGKRVGVIGT 

Arthrobacter winavgllsainfpnlpgidtfegetihtaawp-qgksiiAGrrvgvigt 

2082 YWNAVGLLSAINRPDLPGLETFEGETIHTAAWP-EGKDLTGRRVGVIGT 

Rhodococcus-phi2 -Mono YWNAVGLL S A INF PNLPGLDTFEGETIHTAAWP- EGKSLAGRRVGVI GT 

Rhodococcus-phil-Mono YWNAVGLLSAINFPDLPGLDTFEGETIHTAAWP-EGKNIjAGKRVGvTGT 

Acidovorax FIVTALGLLSAHWPNIPGRESFQGEMYHTAAWP-KDVELRGKRVGVIGT 

Brevibacterium-Monol FLISAMGALSQAIFPAIDGIDEFNGAKYHTAAWPAIXSVDFTGKKVGVIGV 

2093 FFITCCGMLSAPMEDLFPGQQDFRGQIFHTSRWPHGDVELTGKRVGWGV 

Brevibac ter ium-Mono2 AVIVATGFGAKPLYPNIEGLDSFEGECHHTARWPQGGLDMTGKRWVMGT 



2005 GSSGVQAIPLIARQAAELWFQRTPAYTLPAVDEPLDPELQAAIKADYRG 

1273 GSSGIQSIPLIAEQADHLYVFQRSANYSVPAGNTPLDDKRRAEIKAGYAE 

Arthrobacter GSTGQQVITALAPEVEHLTVFVRTPQYSVPVGKRPVTTQQIDEIKADYDN 

2082 GSTGQQVITAIAPTVEHLTVFVRTPQYSVPVGKRAVTDEQIDAVKADYEN 

RhodoCOCCUS -phi 2 -Mono GSTGQQVITAIAPEVEHLTVFVRTPQYSVPVGNRPVTPEQIDAIKADYDR 

Rhodococcus-phil-Mono GSTGQQVITAIAPF^EHLTVFVRTPQYSVPVGNRPVTKBQIDAIKADYDG 

Acidovorax GSTGVQLITAIAPEVKHLTVFQRTPQYSVPTGNRPVSAQEIAEVKRNFSK 

Brevibacterium-Monol GASGIQII PELAKMGFiFWQRTPNYWESNNDKvDAEWMQYVRDNYDE 

2093 GATGIQVIQTIADEVDQUCVEVRTPQYALPMKNPQYDSDDVAAYKDRFEE 

Brevibacterium-Mono2 GASGIQVIQEAAAVAEHLTVFQRTPNLALPMRQQRLSADDNDRYRENIED 



2005 FRARNNEVPTAGLSRFPTNPNSVFLFSTKERDAILEHNWNRGG — PLMLR 

1273 RRALSK— RSGGGSPFVSDPRSALEVSEAERNAAYEERWKLGG— VLFAK 

Arthrobacter I WAQVK — RSGVAFGFEESTVPAMSVTEEERRQVYEKAWEYGGGFRFMFE 
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Rhodococcus-phi 2 -Mono 

Rhodococcus-phil-Mono 

Acidovorax 

Brevibacterium-Monol 

2093 

Brevibacterium-Mono2 



PCTAJS02/27549 
IWTQVK — RSSVAF62ESSWPAH^fiA^P<RVYEEA^Q6GGiFEE'MFG 
IWEQAK — NaAVAFSFEBSTLPAMSVSEEERRRIFJpaftWDHSGOFHFMFG 
IWDSVK — KSAVAFGFEESTLPAMSVSEEERNRIFQEAWDHGGGFRFMFG 
VWQQVR— ESAVAFGFEESTVPAMSVSEAERQRVFQEAWNQGNGFYYMFG 
IFERAS — KHPFGVDMEYPTDSAVEVSEEERKKVFESKWEEGG-FHFANE 
LRTTLP — HTFTGFEYDFEYVWADLAPE-QRREVLENIYEYGS-IiKLWLS 
RFQIRD — NSFAGFDFYFIPQNAADTPEDERTAIYEKMWDEGG-FPLWLG 



2005 
1273 

Arthrobacter 
2082 

Rhodococcus -phi 2 -Mono 

Rhodococcus-phil-Mono 

Acidovorax 

Brevibacterium-Monol 

2093 

Brevibac ter ium-Mono2 



AFGDLLVDSAANEVVAEFVRNKIRQIVTDPEVAAKLTP-T- -HVIGCKRI 
TFADQTSNIEANGTAAAFAERK1RSEVQDQAIADLLI PND- -HPIGTKRI 

TFSDIATDEEANETAASFIRNKTvETIKDPETARKLTP TGLFARRP 

TFGDIATDEEANETAASFIRSKITAMIEDPETARKLTP TGLFARRP 

TFGDIATDEAANEAAASFIRSKIAEIIEDPETARKLMP TGLFARRP 

TFGDIATDEAANEAAASFIRSKIAEIIEDPETARKLMP TGLYAKRP 

TFCDIATDPQANEAAATFIRNKIAEIVKDPETARKLTP TDVYARRP 

CFTDLGTSPEASELASEFIRSKIREWKDPATADLLCPKS — YSFNGKRV 
SFAEMFFDEQVSDEI SEPVREKMRARLIDPELCDLLI PTD — YGFGTHRV 
OTQGLLTDEAANHTFYNFWRSKVHDRVKDPKTAEMLAPATPPHPFGVKRP 



2005 
1273 

Arthrobacter 
2082 

Rhodococcus-phi2-Mono 
Rhodococcus-phi 1 -Mono 
Acidovorax 
Brevibacterium-Monol 
2093 

Brevibacterium-Mono2 



CLSIXSYYETVNRVNVRIiVDIKRHPIEEITPTTARTGE-DSHDLDMLVFAT 
VTDTNYYQSYNRDNVSLVDLKSAPIEAIDEAGIKTAD-AHYELDALVFAT 
IiCDDGYFQVramPNVEAVAIKENPIREVTAKGVVTEDGVLHELDVIVFAT 
LCDDGYFQVFNRPNVEAVAIKENPIREITAKGVVTEIX3VIiHKLDVLVIiAT 
LCnAGYHQVFNRPWEAVAIKENPIREWAKGvVTEDGVLHELDVLVFAT 
LCDNGYYEVYNRPNVEAVAIKEHPIREVTAKGVVTEDGVLHELDVLVFAT 
LCDSGYYRTYNRSNVSLVDvTCATPISAMTPRGIRTADGVEHELDMLIIiAT 
PTGHGYYETFmTNvTttJ^ARGTPITRISSKGrVHGD-TEYELnAIVFAT 
PLETNYLEOTHRPMvTAIGVKNNPIARIVPQGIELTDGTFHELDVliLAT 
SLEQNYFBVYNQDNVDLIDSNATPITRVIjPNGvETPD-GVVECDvXVLAT 



2005 
1273 

Arthrobacter 
2082 

Rhodococcus-phi2 -Mono 
Rhodococcus-phil-Mono 
Acidovorax 

Brevibac ter ium-Monol 
2093 

Brevibacterium-Mono2 



GYDAITGALSRIDIRGRAGLSLOEAWS-DGPRTYLGLGVSGFPNLFIMTG 
GFDAMTGALDRIEIRGRNGETLREINV^-AGPRTYIfilJGVHGFPNLFIVTG 
GFDAVDGNYRRMEISGRIX3vNINDHWD-GQPTSYLGVSTAKFPNWFMVIjG 
GFDAVDGI^RRMTISGRGGLNINDHWD-GQPTSYLGIATANFPNVJFMVLG 
GFTJAVDGNYRRIEIRGRDGLHINDHWD-GQPTSYIiGVSTANFPNWFMvXG 
GFDAVlWNYRRIEIRGRNGLHimHWD-GCPTSYLGVTTANFPNWFMvT^ 
GYDAVDGNYRRIDLRGRGGQTINEHVOT-DTPTSYVGVSTANFPNMFMILG 
GFDAMTGTLTNIDIVGRIXJVILRDKWAQDGIjRTOIGLTVNGFPNFLMSLG 
GFDAGTGALTRIDIRGRGGRSLKEDWG-RDIRTTMGLMVHGYPNMLTTAV 
GFDNNSGGINAIDI KA -GGQLLRDKWA - TGVDTYMGLSTHG F PNLMFL YG 



2005 
1273 

Arthrobacter 
2082 

Rhodococcus -phi2 -Mono 
Rhodococcus-phi 1-Mono 
Acidovorax 
Brevibacterium-Monol 
2093 

Brevibacterium-Mono2 



PGSPSV-IiTNVLVAIHQHATWIGECLKHMTDNDIRTMEATPEAEQNWGDH 
PGSPSV-LSNMILAAEQHTOWIAGAINHLDSAGIDTIEPSAEAVDNWIiDE 
PNGP---FTNLPPSIETQVEWISDTVAYAEENGIRAIEPTPEAEAEWTET 

PNGP FTNLPPSIETQVEWISDTIGYVERTGVRAIEPTPEAESAWTAT 

PNGP FTNLPPSIETQVEWISDTIGYAERNGVRAIEPTPEAEAEWTET 

PNGP FTNLPPSIETQVEWISDTVAYAERNEIRAIEPTPEAEEEWTQT 

PNGP- - - FTNLPPSIEAQVEWITDLVAHMRQHGLATAEPTRDAEDAWGRT 

PQTP YSNLWP I QLGAQWMQRF LKFI Q ERG I EVFESSREAEE IWNAE 

PLAPSAALCNMTTCLQQQTEWISEAIRYMQERDLTVIEPTKEAEDAWVAH 
PQSPSG-FCNGTDFGGAPGDMVADFLIWLKDNGISRFESTEEVEREWRAH 



2005 
1273 

Arthrobacter 
2082 

Rhodococcus-phi2 -Mono 
Rhodococcus-phi 1 -Mono 
Acidovorax 

Brevibacterium-Monol 
2093 

Brevibacterium-Mono2 



VRDLAEQTLLSS CGSWYLGANIPGKRQVFMPLVG-FPDYAKKCAEI 

CSRRASATLFPS ANSWYMGANIPGKPRIFHPFIGGFGVYSDICADV 

CTQIANMTVFTK VDSWIFGANVPGKKPSVLFYLGGLGNYRGVLDDV 

CTDIANMTVFTK VDSWIFGANVPGKKPSVLFYLGGLGNYRAVLADV 

CTAIANATLFTK GDSWIFGANIPGKTPSVLFYLGGLRNYRAVLAEV 

CTDIANATLFTR GDSWIFGANVPGKKPSVLFYLGGLGNYRNVXAGV 

CAEIAEQTLFGQ VESWIFGANSPGKKHTLMFYLAGLGNYRKQLADV 

TIRGAESTVMSIEGPKAGAWFIGGNIPGKSREYQVYMGGGQVYQDWCREA 

HDETAAVNLISK TDSWYVGSNVPGKPRRvLSYTGGVGAYREKAQEI 

VDD1FVNSLF PK AKSWYWGANVPGKPAQMLNYSEASPHI 



2 
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2005 
1273 

Arthrobacter 
2082 

Rhodococcus -phi2 -Mono 

Rhodocoocus-phil-Mono 

Acidovorax 

Brevibac ter ium-Monol 

2093 

Brevibac ter ium-Mono 2 



ASAGYPGFAFQYDP- -VPVNQS [SEQ' ID 

AAAGYRGFELN SAVHA .[SEQ ID 

TANGYRGFELKS-E— AAVAA [SEQ ID 

TEGGYQGFALKT-A — DTVDA [SEQ ID 

ATDGYRGFDVKS-A— EMVTV [SEQ ID 

VADSYRGFELKS -A- -VPVTAZ [SEQ ID 

ANAQYQGFAFQP-L [SEQ ID 

EESDYATFLNADSIDGEKVRESAGMK [SEQ ID 

ADAGYKGFNLR [SEQ ID 

[SEQ ID 



NO:26] 
NO:12] 
NO:44] 
NO: 10] 
NO: 8] 
NO:18) 
NO:14] 
NO:46] 
NO: 16] 
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1870 VNNES DHFEWTIGGGIS6IGAAIHLQRLG-IDNFALLEKADS 

2022 VKLP EHVETLIVGAGFA6MGLAARMLRDNRTADWLIERGAD 

1985 MVDIDPTSGPSAGDEETRTRRTRVWIGAGFGGIGTAVRLKQSG-IDDFVVLERAAE 

1294 MSSR VNDGHIAIIGTGPSGLCMAIELKKKG-IDDFVLYERADD 

2035 MAEIVNGPQ IKPATAKCDERLHA IVIGAGIAGMIASVELSRAG - - 1 PHVILEKNDD 

. ::* *:.*: : .: : . . : *: . 

1870 LGGTWRANTYPGCACDVPSGIjYSYSFAANPDWTRIiFAEQPE IREYIENTAGTHGVDKHVR 

2022 IGGTWRDNTYPGCACDVPTALYSYSFAPSADWSHTFARQPEIYDYLKKVAADTGIGDRVI 

1985 PGGTWQVNTYPGAQCDIPSILYSFSFAPNPNWTRLYPLQPEIYDYLRDCVHRFGLAGHFH 

1294 VGGTWRlWrYPGAACDVPSVLYSYSFAQNPNVJTRIFPPWSELLDYLRSVAAQYDLIiPHIR 

2 03 5 VGGSWWENRYPGAGVDTPSHLYSISSFP-RNWSTHFGKRDEVQGYIjEDFAEANDIRRNVR 



1870 FGVEMLSARWDASQSLWKITTS SGE-LTARFVIAAAGPWNEPLTPAIPGLEAFEGE 

2 022 LNCELEAAVWDEDAALWRVRTS LGS - LTVKALVAATGALSTPKI PDF PGLDQFSGT 

1985 CNQDVTEASWDEQAQ IWRVHTA EW- WEAQFLVAATGPFSAPATPDLPGLESFRGQ 

1294 FGVEVSEMRFDEDRLRWNIQFA SGESVTAAVWNGSGGLSNPYI PQLPGLESFEGA 

2035 FRHEVTRAEFEESKQSWRVSVQRPGEASETLEAPILISAVGLLNRPKIPHLPGIETFRGR 



1870 VFHSSQWNHDYD LTGKLVAWGTGASAVQFVPRIVSQVSALHLYQRTAQWVLPKPD 

2022 TFHSATWNHEHE LRGERVAVIGTGASAVQFVPEIADPAAHVTVFQRTPAWVIPRMD 

1985 MFHTADWNHDHD LRGERIAWGTGASAVQIIPRLQPLADTLTVFQRTPTWILPHPD 

1294 AFHSAKWRHDLD MSGRRVAVIGSGASAIQFVPEIAPHTETLHVFQRSPNWVMPRGD 

2035 LFHSAEWPSELDDPESLRGKRVGIVGTGASAMQIGPAIADRVGSLTIFQRSPQWIAPNDD 



1870 - -HYVPRIERSVMRFVPGAQKALRSIEYGIM— EALGLGFRNP- WILRIVQKLGSAQ- - - 

2 022 - -RTLPAAQKAVYSRI PATQKWRGAVYGFR— ELLGAAMSHATWVLPAFEAAARLH- - - 

1985 — QPMTGWPSALFERVPLTQRLARKGLDLLQ — EALVPGFVYKPSLIiKGIiAAIjGRAH- - - 

1294 — AALSPATRERFSRRPYRQRWIiRWRTYWAF — EKLASAFLGNRKLVEQYRSQALAN 

2035 YFTTIDDGVHWLMDNI PGYREWYRARLSWIFNDKVYSSLQVDPDWPEPSAS INATNHGHR 



1870 LRLQVRD- PKLRKALTPDYTLGCKRLLMSNSYYPALGKPNVSVHANAVEQIRGN 

2022 LSRQVKD- PELRRKliTPDFTIGCKRMLLSNDWLRTLDRADVSLVDSGLVSVTEG 

1985 LRRQVRD-PELRAKLLPHYAFGCKRPTFSNTYYPALASPOTEVVTDGIVEVQER 

1294 LQQQVPD-SDLRQKVTPDYDPGCKRRLISDDWYPALQRENVHLNTSGVSEIRPH 

2035 KFYERYLRDQIXJDRTDLIElASLPDYPPFGKRMLLiDNGWFTMLRKPDVTLVPHGVDALTPS 



1870 TVIGADGVEAEVDAIIFGTGFHILDMPIASKVFDGEGRSLDDHWQGSPQ-AYFGSAVSGF 

2022 GVVDGHGVEHKVDTIIFATGFTPTEPPVAHLITGKRGETLAAHWNGSPN-AYKGTAVSGF 

1985 GVLTADGAFREVDTIVMGTGFRMGDNPSFDTIRGQDGRSLAQTWNGSAE-AFLGTTISGF 

1294 SIIDSEGAEHEVDTLIFATGFQATSFLAPMKVFGREGVELSDSWREGAA-TKLGLASAAF 

2035 GLVDTNGVEHQLDVIVMATGFHSVRVLYPMDIVGRSGRSTGEIWGEHDARAYLGITVPDF 



1870 PNAFILLGPSLGTGHTSAFMIL-EAQIiNYVAQAIGHARRHGWQTIDVREEVQAAFNSQVQ 

2022 PNLFLMYGPNTNLGHSSIVYML-ESQAEYVOTAIjNTMKRERLDALDTOESVQVHYNKGIQ 

1985 PNFFMILGPNS-WYTSQWTI-EAQVEYIVSCILQMDERGIGSIDVRADVQREFVRATD 

1294 PNLWFLNGPNTGLGHNSI I FMI -EAQARYIASAVQYMRRKSITALELDRTVQTGSYAATQ 

2035 POTFVMTGPNTGIXjHGGSFITII^CQVRYIITOAIjKLMQSENLGAMECRAEVNDRYNEAVD 



1870 EAIX3TTVYNAGGCE3YFFDWGRNSFNWPWSSGAMRRRLRDFDPYAYNHTSNPESDNTPP 

2022 HELQHTVWHKGGCSSWYIDPEGRNSVQWPTFTFKFRSLIjEHFDRENYSAR-KIESVQA — 

1985 RRLATSVWNAGGCSSYYLVDGGRNYTFYPGFNRSFRARTKRADLAHYAQVQPVSSAALT- 

12 94 ERMRRTVWASGGCDSWYQSAIX3RIDTLWPASTIEYWLRTRLFRKSDFEIALTTGKG 

2035 RQHAQMVWTHPAMENWYRNPDGRWSVLPVfRINDYWAMTYRVDPSDFRTEPARSESVPTP 



4 
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2022 [SEQ ID NO:38] 

1985 — TARETVRSR [SEQ ID NO:24] 

1294 [SEQ ID NO:42] 

2035 — TARG [SEQ ID NO:36] 
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1861 -MSTEHLDVLIVGAGLSGIGAAYRLQTELPGKSYAILEARANSGGTWDLFKY — PGIRSD 

1976 — MTQHVDVLIIGAGLSGIGAACHLIREQTGSTYAILERRENIGGTWDLFKY — PGIRSD 

1413 - -MSTEGKYALIGAGPSGIAGARNLDR- -AGIAFDGFESHDDVGGLWDIDNP- -HSTVYE 

2034 MSPSPLPSVCIIGAGPTGITTAKRMKE--FGIPFDCYEASDEVGGNWYYKNPNGMSACYQ 



1861 SDMFTLGYP FRPWTDAKAIADGDS ILRYVRDTARENGIDKKIRYNRKVTAASWS 

1976 SDMLTFGFG- - - FRPWI GTKVIjADGAS — IRDYVEETAKEYGVTDHINFGRKWAMDFD 

1413 SAHLISSKGTTAFAEFPMADSVADYPSHIELAEYFRDYADTHDLRRHFAFG — TTVIDVL ' ' 

2034 SLHIOTSK6miAFEDFPVSADLPDFPHHSELFQYFKDYVEHFGLRESIIFN-TSWAAER 

1861 SATSTWTVTVTTGD — EDETLTCNFLYLCSGYYSYDGGYTPDFPGRESFAGEWHPQFWP 

1976 RTAAQWSVTVLVEATGETETWTANVLVGACGYYNYDKGYRPAFPGEDDFRGQIVHPQHWP 

1413 PVDSLWQVTTRSRS-GETSVARYRGVI IANGTLSKPN- -IPTFRG- -DFTGTLMHTSEYR 

2034 DANGLWTVTRSDGE VRTYDVLMVCNGHHWDPN— IPDYPG- -EFDGVLMHSHSYN 



1861 EELDYSDKKVWIGSGATAVTLVPTMSRDASHVTMLQRSPTYILALPSSDKLSDTIR 

1976 EDIiDYTGKKVWIGSGATAITLIPSMAPTAGHVTMLQRSPTOIQALPSEDPVAKGLK 

1413 — -SAEIFRGKRVLVIGAGNSG CDIAVDAVH QAECVDLSVRRGYY 

2 03 4 DPFDPIDMRGKKWWGMGNSG LDIASELGQR YLADKLIVSARRGVW 



1861 -AVLPNQLAHSIARWKSVVWLSFYQLCPJRSPARAKRMLNLAISRQLPKDI PLDP— HF 

1976 LARVPDQIAYKIGRARNIAliQRASFQIiSRTNPKLAKKLFLAQIRLQLGKNVDLR HF 

1413 --FVPKYLF— -GR-PSDTLN QGKPLPPWIKQRVDTLLLKQFTGDPVRFG FP 

2034 — VLPKYLG GV- PGDKL I T — PPWMPRGLRLFLSRRFLGKNLGTMEGYGL P 



1861 TPSYDPWDQRLCWPIXSDLFKAI^SGKASIETDHIDTFTETGrLIASGRELEADirvrrAT 

1976 TPSYNPWDQRLCVVPNGDLFKVLKSGKADIVTDRIATFTEKGIVTESGREIEADVTVTAT 

1413 APDYKIYES-HPW-NSLILHHIGHGDVHVRAD-VDRFEGKTVRFVDGSSADYDLVLCAT 

2034 KPDHRPFEA-HPSA-SGEFLGRAGSGDITFKPA-ITKXDGKQVHFADGTAEDVDWVCAT 



1861 G1^EACGGMSIEVIX3ELVTIX3DRYAYKGMMISE^POTAMCVGY™ASWTLRADLTSMYV 

1976 GLNVQILGGATMSIDGEFVKLNETVAYKSVLYSDI PNFLMILGYTNASWTLKADLAASYL 

1413 GYHLDYPFIAREDLDWSGAAPDLFLNVASRRH-DNLFVI/3MVEASGLGWQGR-YQQAELV 

2034 GYNISFPFFDDPNLLPDKDNRFPLFKRMMKPGIDNLFFMGLAQPMPTLVNFA-EQQSKLV 



1861 CRIjLTEMDKRDYSKCVPHAT-EEMDQRPILD--IiASGYVMRAVEQFPKQGSKSPWNMRQN 

1976 CRVUCIMRDRSYTTFEVHAEPEDFAEESLMGGALTSGYIQRGDGEMPRO^ARGAWKVVNN 

1413 AKLITARTEAPAAAREFSAA AAGPPPD— LSGGYK YLKLG RMA 

2034 AAYLTGKYQLPSANEMQEIT KADEAYF- -LAPYYK— S- PRHTIQLEFDPYVRNMN 



1861 Y1LDR-LHSTFGSINDHMTFSKAPARHSTPVPSKS- [SEQ ID NO.-32] 

1976 YYRDRKLMHDAE I EDGVLQ F SKVDI AWPDSKVASA [SEQ ID NO: 40] 

1413 YYVNK D [SEQ ID NO: 22] 

2034 KEIAKGTKRAAASGNKLPVAARAAAHEIiEKADRA— [SEQ ID NO:28] 
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SEQUENCE LISTING 

<110> E. I. DtJ PONT DE NEMOURS & COMPANY 

<120> GENES ENCODING BAEYER-VILLIGER MONOOXYGENASES 

<130> CL1789 PCT 

<150> 60/315,546 
<151> 2001-08-29 

<160> 113 



<170> Microsoft Office 97 






<210> 1 








<211> 791 








<212> DNA 








<213> Arthrobacter sp. BP2 






<400> 1 
accaccttcg 


acggctcccc cccacaaggg 


ttaggccacc ggcttcgggt gttaccaact 


60 


ttcgtgactt 


gacgggcggt gtgtacaagg 


cccgggaacg tattcaccgc agcgttgctg 


120 


atctgcgatt 


actagcgact ccgacttcat 


ggggtcgagt tgcagacccc aatccgaact 


180 


gagaccggct 


ttttgggatt agctccacct 


cacagtatcg caaccctttg taccggccat 


240 


tgtagcatgc 


gtgaagccca agacataagg 


ggcatgatga tttgacgtcg tccccacctt 


300 


cctccgagtt 


gaccccggca gtctcctatg 


agtccccggc cgaaccgctg gcaacataga 


360 


acgagggttg 


cgctcgttgc gggacttaac 


ccaacatctc acgacacgag ctgacgacaa 


420 


ccatgcacca 


cctgtaaacc ggccgcaagc 


ggggcacctg tttccaggtc tttccggtcc 


480 


atgtcaagcc 


ttggtaaggt tcttcgcgtt 


gcatcgaatt aatccgcatg ctccgccgct 


540 


tgtgcgggcc 


cccgtcaatt cctttgagtt 


ttagccttgc ggccgtactc cccaggcggg 


600 


gcacttaatg 


cgttagctac ggcgcggaaa 


acgtggaatg tcccccacac ctagtgccca 


660 


acgtttacgg 


catggactac cagggtatct 


aatcctgttc gctccccatg ctttcgctcc 


720 


tcagcgtcag 


ttacagccca gagacctgcc 


tttgccatcg gtgttcctct tgatatctgc 


780 


gcatttcacc 


9 




791 


<210> 2 
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<211> 1303 
<212> DNA 

<213> Rhodococcus sp. phil 
<400> 2 

gtgcttaaca catgcaagtc gaacgatgaa gcccagcttg ctgggtggat tagtggcgaa 60 

cgggtgagta acacgtgggt gatctgccct gcactctggg ataagcctgg gaaactgggt 120 

ctaataccgg atatgacctc gggatgcatg tcctggggtg gaaagttttt cggtgcagga 180 

tgagcccgcg gcctatcagc ttgttggtgg ggtaatggcc taccaaggcg acgacgggta 240 

gccggcctga gagggcgacc ggccacactg ggactgagac acggcccaga ctcctacggg 300 

aggcagcagt ggggaatatt gcacaatggg cgcaagcctg atgcagcgac gccgcgtgag 360 

ggatgacggc cttcgggttg taaacctctt tcacccatga cgaagcgcaa gtgacggtag 420 

tgggagaaga agcaccggcc aactacgtgc cagcagccgc ggtaatacgt aggtgcgagc 480 

gttgtccgga attactgggc gtaaagagct cgtaggcggt ttgtcgcgtc gtctgtgaaa 540 

tcccgcagct caactgcggg cttgcaggcg atacgggcag actcgagtac tgcaggggag 600 

actggaattc ctggtgtago ggtgaaatgc gcagatatca ggaggaacac cggtggcgaa 660 

ggcgggtctc tgggcagtaa ctgacgctga ggagcgaaag cgtgggtagc gaacaggatf 720 

agataocctg gtagtccacg ccgtaaacgg tgggcgctag gtgtgggttt ccttccacgg 780 

gatccgtgcc gtagccaacg cattaagcgc cccgcctggg gagtacggcc gcaaggctaa 840 

aactcaaagg aattgacggg ggcccgcaca agcggcggag catgtggatt aattcgatgc 900 

aacgcgaaga accttacctg ggtttgacat gtaccggacg actgcagaga tgtggtttcc 960 

cttgtggccg gtagacaggt ggtgcatggc tgtcgtcagc tcgtgtcgtg agatgttggg 1020 

ttaagtcccg caacgagcgc aacccttgtc ctgtgttgcc agcacgtgat ggtggggact 1080 

cgcaggagac tgccggggtc aactcggagg aaggtgggga cgacgtcaag tcatcatgcc 1140 

ccttatgtcc agggcttcac acatgctaca atggtcggta cagagggctg cgataccgtg 1200 

aggtggagcg aatcccttaa agccggtctc agttcggatc ggggtctgca actcgacccc 1260 

gtgaagtcgg agtcgctagt aatcgcagat cagcaacgct gcg 1303 

<210> 3 
<211> 1296 
<212> DNA 
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<213> Rhodococcus sp. phi2 
<400> 3 

gcttaacaca tgcaagtcga acgatgaagc ccagcttgct gggtggatta gtggcgaacg 60 

ggtgagtaac acgtgggtga tctgccctgc acttcgggat aagcctggga aactgggtct 120 

aataccggat aggacctcgg gatgcatgtt ccggggtgga aaggttttcc ggtgcaggat 180 

gggcccgcgg cctatcagct tgttggtggg gtaacggccc accaaggcga cgacgggtag 240 

ccggcctgag agggcgaccg gccacactgg gactgagaca cggcccagac tcctacggga 300 

ggcagcagtg gggaatattg cacaatgggc gcaagcctga tgcagcgacg ccgcgtgagg 360 

gatgacggcc ttcgggttgt aaacctcttt cagtaccgac gaagcgcaag tgacggtagg 42 0 

tacagaagaa gcaccggcca actacgtgcc agcaagccgc ggtaatacgt aaggtgcgaa 480 

gcgttgtccg gaattactgg gcgtaaagag ctcgtaggcg gtttgtcgcg tcgtctgtga 540 

aaacccgcag ctcaactgcg ggcttgcagg cgatacgggc agacttgagt actgcagggg 600 

agactggaat tcctggtgta gcggtgaaat gcgcagatat caggaggaac accggtggcg 660 

aaggcgggtc tctgggcagt aactgacgct gaggagcgaa agcgtgggta gcgaacagga 720 

ttagataccc tggtagtcca cgccgtaaac ggtgggegct aggtgtgggt ttccttccac 780 

gggatccgtg ccgtagctaa cgcattaagc gccccgcctg gggagtacgg ccgcaaggct 840 

aaaactcaaa ggaattgacg ggggcccgca caagcggcgg agcatgtgga ttaattcgat 900 

gcaacgcgaa gaaccttacc tgggtttgac atacaccgga ccgccccaga gatggggttt 960 

cccttgtggt cggtgtacag gtggtgcatg gctgtcgtca gctcgtgtcg tgagatgttg 1020 

ggttaagtcc cgcaacgagc gcaacccttg tcctgtgttg ccagcacgta atggtgggga 1080 

ctcgcaggag actgccgggg tcaactcgga ggaaggtggg gacgacgtca agtcatcatg 1140 

ccccttatgt ccagggcttc acacatgcta caatggccgg tacagagggc tgcgataccg 1200 

cgaggtggag cgaatccctt aaagccggtc tcagttcgga tcggggtctg caactcgacc 1260 

ccgtgaagtc ggagtcgcta gtaatcgcag atcagc 1296 

<210> 4 
<211> 1388 
<212> DNA 

<213> Brevibacterium sp. HCU 
<400> 4 
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cgcccttgag 


tttgatcctg 


gctcaggacg aacgctggct gcgtgcttaa cacatgcaag 


60 


tcgaacgctg 


aagccgacag 


cttgctgttg gtggatgagt ggcgaacggg tgagtaacac 


120 


gtgagtaacc 


tgcccctgat 


ttcgggataa gcctgggaaa ctgggtctaa taccggatac 


X80 


gaccacctga 


cgcatgttgg 


gtggtggaaa gtttttcgat cggggatggg ctcgcggcct 


240 


atcagcttgt 


tggtggggta 


atggcctacc aaggcgacga cgggtagccg gcctgagagg 


300 


gcgaccggcc 


acactgggac 


tgagacacgg cccagactcc tacgggaggc agcagtgggg 


360 


aatattgcac 


aatgggggaa 


accctgatgc agcgacgcag cgtgcgggat gacggccttc 


420 


gggttgtaaa 


ccgctttcag 


cagggaagaa gcgaaagtga cggtacctgc agaagaagta 


480 


ccggctaact 


acgtgccagc 


agccgcggta atacgtaggg tacgagcgtt gtccggaatt 


540 


attgggcgta 


aagagctcgt 


aggtggttgg tcacgtctgc tgtggaaacg caacgcttaa 


600 


cgttgcgcgt 


gcagtgggta 


cgggctgact agagtgcagt aggggagtct ggaattcctg 


660 


gtgtagcggt 


gaaatgcgca 


gatatcagga ggaacaccgg tggcgaaggc gggactctgg 


720 


gctgtaactg 


acactgagga 


gcgaaagcat ggggagcgaa caggattaga taccctggta 


780 


gtccatgccg 


taaacgttgg 


gcactaggtg tgggggacat tccacgttct ccgcgccgta 


840 


gctaacgcat 


taagtgcccc 


gcctggggag tacggtcgca aggctaaaac tcaaaggaat 


900 


tgacgggggc 


ccgcacaagc 


ggcggagcat gcggattaat tcgatgcaac gcgaagaacc 


9S0 


ttaccaaggc 


ttgacataca 


ctggaccgtt ctggaaacag ttcttctctt tggagctggt 


1020 


gtacaggtgg 


tgcatggttg 


tcgtcagctc gtgtcgtgag atgttgggtt aagtcccgca 


1080 


acgagcgcaa 


ccctcgttct 


atgttgccag cacgtgatgg tgggaactca taggagactg 


1140 


ccggggtcaa 


ctcggaggaa 


ggtggggatg acgtcaaatc atcatgccct ttatgtcttg 


1200 


ggcttcacgc 


atgctacaat 


ggctggtaca gagagaggcg aacccgtgag ggtgagcgaa 


1260 


tcccttaaag 


ccagtctcag 


ttcggatcgt agtctgcaat tcgactacgt gaagtcggag 


1320 


tcgctagtaa 


tcgcagatca 


gcaacgctgc ggtgaatacg ttcccgggcc ttgtacacac 


13B0 


cgcccgta 






1388 



<210> 5 
<211> 895 
<212> DNA 

<213> Brachyraonas sp. CHX 
<400> 5 

taggctaact acttctggca gaacccgctc ccatggtgtg acgggcggtg tgtacaagac 60 
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ccgggaacgt 


attcaccgcg 


acatgctgat 


ccgcgattac 


tagcgattcc gacttcacgc 


120 


agtcgagttg 


cagactgcga 


tccggactac 


gaccggcttt 


gtgggattgg ctccccctcg 


180 


cgggttggct 


accctctgta 


ccggccattg 


tatgacgtgt 


gtagccccac ctataagggc 


240 


catgaggact 


tgacgtcatc 


cccaccttcc 


tccggtttgt 


caccggcagt cccattagag 


300 


tgccctttcg 


tagcaactaa 


tggcaagggt 


tgcgctcgtt 


gcgggactta acccaacatc 


360 


tcacgacacg 


agctgacgac 


agccatgcag 


cacctgtgtg 


caggttctct ttcgagcact 


420 


cccaaatctc 


ttcaggattc 


ctgccatgtc 


aaaggtgggt 


aaggtttttc gcgttgcatc 


480 


gaattaaacc 


acatcatcca 


ccgcttgtgc 


gggtccccgt 


caattccttt gagtttcaac 


540 


cttgcggccg 


tactccccag 


gcggtcaact 


tcacgcgttg 


gcttcgttac tgagtcagct 


600 


aagacccaac 


aaccagttga 


catcgtttag 


ggcgtggact 


accagggtat ctaatcctgt 


660 


ttgctcccca 


cgctttcgtg 


catgagcgtc 


agtgcaggcc 


caggggattg ccttcgccat 


720 


cggtgttcct 


ccgcatatct 


acgcatttca 


ctgctacacg 


cggaattcca tccccctctg 


780 


ccgcactcca 


gctttgcagt 


cacaaaggca 


gttcccaggt 


tgagcccggg gatttcacct 


840 


ctgtcttaca 


aaaccgcctg 


cgcacgcttt 


acgcccagta 


attccgatga acgct 


895 



<210> 6 

<211> 1439 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<220> 

<221> misc_feature 

<222> (1417) .. (1417) 

<223> N = G or A or T or C 



<400> 6 

aaaacgctgg gcgggcgttg cttaacacat gcaattcgag cggtaaggcc tttcggggta 60 

cacaagcggc gaacgggtga gtaacacgtg ggtgatctgc cctgcacttc gggataagcc 120 

tgggaaactg ggtctaatac cggatatgac ctcaggtcgc atgacttggg gtggaaaaat 180 

ttatcggtgc aggatgggcc cgcggcctat cagcttgttg gtggggtaat ggcctaccaa 240 

ggcgacaacg ggtacccgac ctgaaagggt gaccggccac actgggactg aaacacggcc 300 
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caaactccta 


cgggaggcag 


cagtggggaa 


tattgcacaa 


tgggcgaaag 


cctgatgcac 


360 


cgaccccgcg 


tgagggatga 


cggccttcgg 


gttgtaaacc 


tctttcagca 


gggacaaacg 


420 


caagtgacgg 


tacctgcaga 


agaagccccg 


gctaactacg 


tgccagcagc 


cgcggtatta 


480 


cttagggtgc 


aagcgttgtc 


cggaattact 


gggcgtaaag 


agttcgtacg 


cggtttgtcg 


540 


cgtcgtttgt 


gaaaaccagc 


agctcaactg 


ctggcttgca 


ggcgatacgg 


gcagacttga 


500 


gtactgcagg 


ggagactgga 


attcctggtg 


tagcggtgaa 


atgcgcagat 


atcaggagga 


660 


acaccggtgg 


cgaaggcggg 


tctctgggca 


ctaactgacg 


ctgaggaacg 


aaagcgtggg 


720 


tagcgaacag 


gattacatac 


cctggtagtc 


cacgccgtaa 


acggtgggcg 


ctaggtgtgg 


780 


gttccttcca 


cggaatccgt 


gccgtagcta 


acgcattaag 


cgccccgcct 


ggggagtacg 


840 


gccgcaaggc 


taaaactcaa 


aggaattgac 


gggggcccgc 


acaatcggcg 


gaacatgtgg 


900 


attaattcga 


tgcaaegcga 


agaaccttac 


tgggtttgac 


atataccgga 


aagctgcaga 


960 


gatgtggccc 


cctttgtggt 


cggtatacag 


gtggtgcatg 


gctgtcgtca 


gctcgtgtcg 


1020 


tgagatgttg 


ggttaagtcc 


cgcaacgagc 


gcaaccccta 


tcttatgttg 


ccagcacgtt 


1080 


atggtgggga 


ctcgtaagag 


actgccgggg 


tcaactcgga 


ggaaggtggg 


gacgacgtca 


1140 


agtcatcatg 


ccccttatgt 


ccagggcttc 


acacatgcta 


caatggccag 


tacagagggc 


1200 


tgcgagaccg 


tgaggtggag 








tcggggtctg 




caactcgacc 


ccgtgaagtc 


ggagtcgcta 


gtaatcgcag 


atcagcaacg 


ctgcggtgaa 


1320 


tacgttcccg 


ggccttgtac 


acaccgcccg 


tcacgtcatg 


aaagtcggta 


acacccgaag 


1380 


ccggtggctt 


aaccccttgt 


gcgaggagcc 


gtcgaangtg 


ggatcggcga 


ttgggcgcc 


1439 


<210> 7 














<211> 1626 












<212> DNA 














<213> Rhodococcus sp 


. phil 










<400> 7 

atgactgcac agatctcacc 


cacagttgtc 


gacgccgttg 


tcatcggcgc 


cggattcggc 


60 


ggcatctacg ccgtgcacaa 


gctgcacaac 


gaacagggcc 


tgaccgtggt 


cggtttcgac 


120 


aaggcggacg gccccggcgg 


tacctggtac 


tggaaccgct 


acccgggagc 


gctctccgac 


180 


accgagagtc 


atctctaccg 


cttctcgttc 


gaccgcgacc 


tgctgcagga 


cggcacgtgg 


240 


aagaccacgt acatcaccca 


gcccgagatc 


ctcgagtatc 


tcgagagcgt 


cgtcgaccgg 


300 


ttcgacctgc gtcgtcactt 


ccggttcggc 


accgaggtca 


cctcggcgat 


ctacctcgag 


360 
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gacgagaacc tgtgggaggt ctccaccgac aagggtgagg tctaccgggc caagtacgtc 420 

gtcaacgccg tgggcctgct ctccgccatc aacttccccg acctccccgg cctcgacacc 480 

ttcgagggcg agaccatcca caccgccgcc tggcccgagg gcaagaacct cgccggcaag 540 

cgtgtcggtg tcatcggtac cggatcgacc gggcagcagg tcatcaccgc cctcgcgccg 600 

gaggtcgagc acctcaccgt cttcgtccgc accccgcagt actccgtgcc ggtcggcaac 660 

cgtcccgtga cgaaggaaca gatcgacgcg atcaaggccg actacgacgg tatctgggac 720 

agcgtcaaga agtccgcggt ggccttcggg ttcgaggagt ccaccctgcc tgccatgtcc 780 

gtctcggaag aggagcgcaa ccgcatcttc caggaggcgt gggaccacgg cggcggcttc 840 

cgcttcatgt tcggcacctt cggcgacatc gccaccgacg aggccgccaa cgaagctgcg 900 

gcatcgttca tccgctccaa gatcgccgag atcatcgagg atccggaaac ggcccgcaag 960 

ctgatgccga ccggtctgta cgccaagcgt ccgctgtgcg acaacggcta ctacgaggtg 1020 

tacaaccgcc cgaacgtcga ggccgtcgcg atcaaggaga accccatccg tgaggtcacc 1080 

gccaagggcg tcgtgaccga ggacggtgtc ctccacgaac tcgacgtgct cgtcttcgcc 1140 

accggcttcg acgccgtcga cggcaactac cgccggatcg agatccgcgg ccggaacggc 1200 

ctgcacatca acgaccactg ggacggccag ccgacgagct acctcggcgt caccaccgcg 1260 

aacttcocca actggttcat ggtgctcggt cccaacggcc cgttcacaaa cctgccgccg 1320 

agcatcgaaa cgcaggtcga gtggatcagc gacaccgtcg cctacgccga gcgcaacgag 1380 

atccgtgcga tcgaacccac cccggaggcc gaggaggagt ggacgcagac ctgcaccgac 1440 

atcgcgaacg ccacgctgtt cacccgcggt gactcctgga tcttcggcgc gaatgttccg 1500 

ggcaagaagc cgagcgtcct gttctacctg ggcggactgg gcaactaccg caacgtcctc 1560 

gcgggtgtcg tcgccgacag ctaccgaggt ttcgagttga agtccgctgt cccggtgacc 1620 

gcctga 1626 

<210> 8 
<211> 542 
<212> PRT 

<213> Rhodococcus sp. phil 



<400> 8 

Met Thr Ala Gin He Ser Pro Thr Val Val Asp Ala Val Val He Gly 
15 10 is 
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Ala Gly Phe Gly Gly lie Tyr Ala Val His Lys Leu His Asn Glu Gin 
20 25 30 

Gly Leu Thr Val Val Gly Phe Asp Lys Ala Asp Gly Pro Gly Gly Thr 
35 40 45 

Trp Tyr Trp Asn Arg Tyr Pro Gly Ala Leu Ser Asp Thr Glu Ser His 
50 55 60 

Leu Tyr Arg Phe Ser Phe Asp Arg Asp Leu Leu Gin Asp Gly Thr Trp 
65 70 75 80 

Lys Thr Thr Tyr lie Thr Gin Pro Glu He Leu Glu Tyr Leu Glu Ser 



Val Val Asp Arg Phe Asp Leu Arg Arg His Phe Arg Phe Gly Thr Glu 
100 105 HO 



Val Thr Ser Ala He Tyr Leu Glu Asp Glu Asn Leu Trp Glu Val Ser 
115 120 125 



Thr Asp Lys Gly Glu Val Tyr Arg Ala Lys Tyr Val Val Asn Ala Val 
130 135 140 



Gly Leu Leu Ser Ala He Asn Phe Pro Asp Leu Pro Gly Leu Asp Thr 
145 150 155 ISO 



Phe Glu Gly Glu Thr He His Thr Ala Ala Trp Pro Glu Gly Lys Asn 
165 170 175 



Leu Ala Gly Lys Arg Val Gly Val He Gly Thr Gly Ser Thr Gly Gin 
180 ' 185 190 



Gin Val He Thr Ala Leu Ala Pro Glu Val Glu His Leu Thr Val Phe 
195 200 205 



Val Arg Thr Pro Gin Tyr Ser Val Pro Val Gly Asn Arg Pro Val Thr 
210 215 220 



Lys Glu Gin He Asp Ala He Lys Ala Asp Tyr Asp Gly He Trp Asp 
225 230 235 240 



Ser Val Lys Lys Ser Ala Val Ala Phe Gly Phe Glu Glu Ser Thr Leu 
245 250 255 
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Pro Ala Met Ser Val Ser Glu Glu Glu Arg Asn Arg He Phe Gin Glu 
260 265 270 



Ala Tip Asp His Gly Gly Gly Phe Arg Phe Met Phe Gly Thr Phe Gly 
275 280 285 



Asp He Ala Thr Asp Glu Ala Ala Asn Glu Ala Ala Ala Ser Phe He 
290 295 300 



Arg Ser Lys He Ala Glu He He Glu Asp Pro Glu Thr Ala Arg Lys 
305 310 315 320 



Leu Met Pro Thr Gly Leu Tyr Ala Lys Arg Pro Leu Cys Asp Asn Gly 
325 330 335 



Tyr Tyr Glu Val Tyr Asn Arg Pro Asn Val Glu Ala Val Ala He Lys 
340 345 350 



Glu Asn Pro He Arg Glu Val Thr Ala Lys Gly Val Val Thr Glu Asp 
355 360 365 



Gly Val Leu His Glu Leu Asp Val Leu Val Phe Ala Thr Gly Phe Asp 
370 375 380 



Ala Val Asp Gly Asn Tyr Arg Arg He Glu He Arg Gly Arg Asn Gly 
385 390 395 400 



Leu His He Asn Asp His Trp Asp Gly Gin Pro Thr Ser Tyr Leu Gly 
405 410 415 



Val Thr Thr Ala Asn Phe Pro Asn Trp Phe Met Val Leu Gly Pro Asn 
420 425 430 



Gly Pro Phe Thr Asn Leu Pro Pro Ser lie Glu Thr Gin Val Glu Trp 
435 440 445 



He Ser Asp Thr Val Ala Tyr Ala Glu Arg Asn Glu He Arg Ala He 
450 455 460 



Glu Pro Thr Pro Glu Ala Glu Glu Glu Trp Thr Gin Thr Cys Thr Asp 
465 470 475 480 



He Ala Asn Ala Thr Leu Phe Thr Arg Gly Asp Ser Trp He Phe Gly 
485 490 495 



Ala Asn Val Pro Gly Lys Lys Pro Ser Val Leu Phe Tyr Leu Gly Gly 
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500 505 510 

Leu Gly Asn Tyr Arg Asn Val Leu Ala Gly Val Val Ala Asp Ser Tyr 
515 520 525 

Arg Gly Phe Glu Leu Lys Ser Ala Val Pro Val Thr Ala Glx 
530 535 540 

<210> 9 
<211> 1623 
<212> DNA 

<213> Rhodococcus sp. phi2 
<400> 9 

atgaccgcac agaccatcca caccgtcgac gccgtcgtca tcggcgccgg attcggcggc 60 

. atctacgccg tccacaagct gcaccacgaa ctcggcctga ccaccgtcgg attcgacaag 120 

gcagacggcc ccggcggcac ctggtactgg aaccgctacc cgggcgccct ctccgacacg 180 

gagagccacc tctaccgctt ctccttcgac cgcgacctgc tgcaggacgg cacctggaag 240 

aacacgtacg tcacccagcc cgagatcctg gagtatctcg aggacgtcgt cgaccgcttc 300 

gacctgcgcc gccacttccg gttcggcacc gaggtcacct cggcgatcta tctcgacgac 360 

gagaacctct gggaggtcac caccgacggc ggcgacgtct atcgggcgac ctacgtcgtc 42 0 

aacgccgtcg ggctgctctc cgccatcaac ttcccgaacc tgcccggcct ggacacgttc 480 

gagggcgaga ccatccacac cgccgcctgg ccggagggca agagcctcgc cgggcgccgc 540 

gtcggcgtca tcggtaccgg ttccaccggc cagcaggtca tcacggcgct ggcgccggag 600 

gtcgagcacc tcaccgtctt cgtccggacc ccgcagtact ccgtaccggt cggcaaccgt 660 

cccgtgaccc cggagcagat cgacgcgatc aaggccgact acgaccgaat ctgggagcag 720 

gccaagaact ccgcggtggc cttcggcttc gaggagtcca ccctgccggc catgtccgtc 780 

tcggaggagg agcgcaaccg gatcttccag gaggcctggg accacggcgg cggattccgt 840 

ttcatgttcg gcaccttcgg tgacatcgcc accgacgagg ccgccaacga agccgccgcg 900 

tcgttcatcc gctccaagat cgccgagatc atcgaggatc cggagaccgc ccgcaagctg 960 

atgccgaccg gtctgttcgc caagcgcccg ctgtgcgacg ccggctacca ccaggtcttc 1020 

aaccggccga acgtggaagc ggttgccatc aaggagaacc ccatccgcga ggtcaccgcg 1080 

aagggcgtgg tgaccgagga cggcgtcctg cacgagttgg acgtgctcgt cttcgccacc 1140 

ggcttcgacg ccgtggacgg caactaccgg cgcatcgaga tccgcggccg ggacggcctg 1200 
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cacatcaacg accactggga 


cggccagccg accagctacc tgggcgtctc cacggcgaac 


1260 


ttccccaact ggttcatggt 


gctgggcccc aacggtccgt tcacgaacct gcccccgagc 


1320 


atcgagaccc aggtcgagtg 


gatcagcgac acgatcgggt acgccgagcg caacggtgtg 


1380 


cgggccatcg agcccacgcc 


ggaggccgag gccgaatgga ccgagacctg caccgcgatc 


1440 


gcgaacgcca cgctgttcac 


caagggcgat tcgtggatct tcggcgcgaa catcccgggc 


1500 


aagacgccga gcgtactgtt 


ctacctgggc ggcctgcgca actaccgtgc cgtcctcgcc 


1560 


gaggtcgcga ccgacggata 


ccggggcfctc ga.cgtga.agfc ccgccgagafc ggfccacggfcc 


1520 


tga 




1623 


<210> 10 






<211> 541 






<212> PRT 






<213> Rhodococcus sp 


. phi 2 





<400> 10 

Met Thr Ala Gin Thr He His Thr Val Asp Ala Val Val He Gly Ala 
15 10 15 

Gly Phe Gly Gly He Tyr Ala Val His Lys Leu His His Glu Leu Gly 
20 25 30 

Leu Thr Thr Val Gly Phe Asp Lys Ala Asp Gly Pro Gly Gly Thr Trp 
35 40 45 

Tyr Trp Asn Arg Tyr Pro Gly Ala Leu Ser Asp Thr Glu Ser His Leu 
50 55 60 

Tyr Arg Phe Ser Phe Asp Arg Aap Leu Leu Gin Asp Gly Thr Trp Lys 
65 70 75 80 

Asn Thr Tyr Val Thr Gin Pro Glu He Leu Glu Tyr Leu Glu Asp Val 
85 90 95 

Val Asp Arg Phe Asp Leu Arg Arg His Phe Arg Phe Gly Thr Glu Val 
100 105 110 

Thr Ser Ala He Tyr Leu Asp Asp Glu Asn Leu Trp Glu Val Thr Thr 
115 120 125 
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Asp Gly Gly Asp Val Tyr Arg Ala Thr Tyr Val Val Asn Ala Val Gly 
130 ~ 135 140 



Leu Leu Ser Ala He Asn Phe Pro Asn Leu Pro Gly Leu Asp Thr Phe 
145 150 155 160 



Glu Gly Glu Thr He His Thr Ala Ala Trp Pro Glu Gly Lys Ser Leu 
165 170 175 



Ala Gly Arg Arg Val Gly Val He Gly Thr Gly Ser Thr Gly Gin Gin 
180 185 190 



Val He Thr Ala Leu Ala Pro Glu Val Glu His Leu Thr Val Phe Val 
195 200 205 

Arg Thr Pro Gin Tyr Ser Val Pro Val Gly Asn Arg Pro Val Thr Pro 
210 215 220 



Glu Gin He Asp Ala He Lys Ala Asp Tyr Asp Arg He Trp Glu Gin 
225 230 235 240 



Ala Lys Asn Ser Ala Val Ala Phe Gly Phe Glu Glu Ser Thr Leu Pro 
245 250 255 



Ala Met Ser Val Ser Glu Glu Glu Arg Asn Arg He Phe Gin Glu Ala 
260 265 270 



Trp Asp His Gly Gly Gly Phe Arg Phe Met Phe Gly Thr Phe Gly Asp 
275 280 285 



He Ala Thr Asp Glu Ala Ala Asn Glu Ala Ala Ala Ser Phe He Arg 
290 295 300 



Ser Lys He Ala Glu He He Glu Asp Pro Glu Thr Ala Arg Lys Leu 
305 310 315 320 



Met Pro Thr Gly Leu Phe Ala Lys Arg Pro Leu Cys Asp Ala Gly Tyr 
325 330 335 



His Gin Val Phe Asn Arg Pro Asn Val Glu Ala Val Ala He Lys Glu 
340 345 350 



Asn Pro He Arg Glu Val Thr Ala Lys Gly Val Val Thr Glu Asp Gly 
355 360 365 
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Val Leu His Glu Leu Asp Val Leu Val Phe Ala Thr Gly Phe Asp Ala 
370 375 380 



Val Asp Gly Asn Tyr Arg Arg He Glu He Arg Gly Arg Asp Gly Leu 
385 390 395 400 



His lie Asn Asp His Trp Asp Gly Gin Pro Thr Ser Tyr Leu Gly Val 
405 410 415 



Ser Thr Ala Asn Phe Pro Asn Trp Phe Met Val Leu Gly Pro Asn Gly 

420 425 430 



Pro Phe Thr Asn Leu Pro Pro Ser He Glu Thr Gin Val Glu Trp He 
435 440 445 



Ser Asp Thr He Gly Tyr Ala Glu Arg Asn Gly Val Arg Ala He Glu 
450 455 460 



Pro Thr Pro Glu Ala Glu Ala Glu Trp Thr Glu Thr Cys Thr Ala He 
465 470 475 480 



Ala Asn Ala Thr Leu Phe Thr Lys Gly Asp Ser Trp He Phe Gly Ala 
485 490 495 



Asn He Pro Gly Lys Thr Pro Ser Val Leu Phe Tyr Leu Gly Gly Leu 
500 505 510 



Gly Phe Asp Val Lys Ser Ala Glu Met Val Thr Val Glx 
530 535 540 



<210> 11 
<211> 1596 
<212> DNA 

<213> Arthrobacter sp. BP2 
<400> 11 

atgactgcac agaacacttt ccagaccgtt gacgccgtcg tcatcggcgc cggcttcggc 60 
ggcatctacg ccgtccacaa gcttcacaac gagcagggtc tgaccgttgt cggcttcgac 120 
aaggccgacg gtcccggcgg cacctggtac tggaaccgct acccgggcgc tctctctgac 180 
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accgagagcc acgtctaccg cttctctttc gataagggcc tcctgcagga cggcacctgg 240 

aagcacacct acatcaccca gcccgagatc ctcgagtacc ttgaggacgt cgttgaccgc 3 00 

tttgacctgc gg'cgccactt ccgctttggt accgaggtca agtccgccac ctacctcgaa 360 

gacgagggcc tgtgggaagt gaccaccggc ggcggcgcgg tgtaccgggc taagtacgtc 420 

atcaacgccg tggggctgct gtcagccatc aacttcccga acctgcccgg gatcgacacc 480 

tttgagggcg agaccatcca caccgccgcc tggccgcagg gcaagtccct cgccggtcgc 54 0 

cgcgtgggtg tgatcggcac cggttccacc ggccagcagg tcatcacggc gctggcaccg 600 

gaagttgaac acctgaccgt cttcgtcagg accccgcagt actccgtccc ggtgggcaag 660 

cgccccgtga ccacccagca gattgacgag atcaaggccg actacgacaa catctgggca 720 

caggtcaagc gttccggcgt agccttcggc ttcgaggaaa gcaccgtgcc ggccatgagc 780 

gtcaccgaag aagaacgccg ccaggtctac gagaaggcct gggaatacgg cggcggcttc 840 

cgcttcatgt tcgaaacctt cagcgacatc gccaccgacg aggaggccaa cgagactgcg 900 

gcatccttca tccggaacaa gatcgtcgag accatcaagg atccggagac ggcacggaaa 960 

ctgacgccga cgggcttgtt cgcccgtcgc ccgctctgcg acgacggctt acttccaggt 1020 

gttcaaccgg cccaacgtcg aggctgtcgc tatcaaggaa aaccccattc gggaagtcac 1080 

ggccaagggt gtggtgacgg aggacggcgt gctgcacgag ctggacgtca tcgtcttcgc 1140 

gaocggtttc gacgccgtgg acggcaatta ccgccgcatg gagatcagcg ggcgcgacgg 1200 

cgtgaacatc aacgaccact gggacgggca gcccaccagc tacctgggcg tttccacagc 1260 

gaagttcccc aactggttca tggtgctggg acccaacggc ccgttcacga acctgccgcc 1320 

gagcatcgag acgcaggtcg aatggatcag cgacacggtg gcctacgcgg aggaaaacgg 1380 

aatccgggcg atcgagccga ccccggaggc cgaagccgag tggaccgaga cgtgcacaca 1440 

gatcgcgaac atgacggtgt tcaccaaggt cgattcatgg atcttcggcg cgaacgttcc 1500 

gggcaagaag cccagcgtgc tgttctatct gggcggcctg ggcaactacc gcggcgtcct 1560 

ggacgatgtc accgacaacg gataccgcgg ctttga 1596 

<210> 12 
<211> 532 
<212> PRT 

<213> Arthrobacter sp. BP2 
<400> 12 
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Met Thr Ala Gin Asn Thr Phe Gin Thr Val Asp Ala Val Val lie Gly 
15 10 15 

Ala Gly Phe Gly Gly. lie Tyr Ala Val His Lya Leu His Asn Glu Gin 
20 25 30 

Gly Leu Thr Val Val Gly Phe Asp Lys Ala Asp Gly Pro Gly Gly Thr 
35 40 45 

Trp Tyr Trp Asn Arg Tyr Pro Gly Ala Leu Ser Asp Thr Glu Ser His 
50 55 60 

Val Tyr Arg Phe Ser Phe Asp Lys Gly Leu Leu Gin Asp Gly Thr Trp 
65 70 75 80 

Lys His Thr Tyr lie Thr Gin Pro Glu lie Leu Glu Tyr Leu Glu Asp 



Val Val Asp Arg Phe Asp Leu Arg Arg His Phe Arg Phe Gly Thr Glu 
100 105 HO 



Val Lys Ser Ala Thr Tyr Leu Glu Asp Glu Gly Leu Trp Glu Val Thr 
115 120 125 



Thr Gly Gly Gly Ala Val Tyr Arg Ala Lys Tyr Val lie Asn Ala Val 
130 135 140 



Gly Leu Leu Ser Ala He Asn Phe Pro Asn Leu Pro Gly He Asp Thr 
145 150 155 160 



Phe Glu Gly Glu Thr He His Thr Ala Ala Trp Pro Gin Gly Lys Ser 
165 170 175 



Leu Ala Gly Arg Arg Val Gly Val He Gly Thr Gly Ser Thr Gly Gin 
180 185 190 



Gin Val He Thr Ala Leu Ala Pro Glu Val Glu His Leu Thr Val Phe 
195 200 205 



Val Arg Thr Pro Gin Tyr Ser Val Pro Val Gly Lys Arg Pro Val Thr 
210 215 220 



Thr Gin Gin He Asp Glu He Lys Ala Asp Tyr Asp Asn He Trp Ala 
225 230 235 240 



Gin Val Lys Arg Ser Gly Val Ala Phe Gly Phe Glu Glu Ser Thr Val 
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Pro Ala Met Ser Val Thr Glu Glu Glu Arg Arg Gin Val Tyr Glu Lys 
260 265 270 



Ala Trp Glu Tyr Gly Gly Gly Phe Arg Phe Met Phe Glu Thr Phe Ser 
275 280 285 



Asp He Ala Thr Asp Glu Glu Ala Asn Glu Thr Ala Ala Ser Phe He 
290 295 300 



Arg Asn Lys He Val Glu Thr He Lys Asp Pro Glu Thr Ala Arg Lys 
305 310 315 320 

Leu Thr Pro Thr Gly Leu Phe Ala Arg Arg Pro Leu Cys Asp Asp Gly 
325 330 335 



Leu Leu Pro Gly Val Gin Pro Ala Gin Arg Arg Gly Cys Arg Tyr Gin 
340 345 350 



Gly Lys Pro His Ser Gly Ser His Gly Gin Gly Cys Gly Asp Gly Gly 
355 360 365 



Arg Arg Ala Ala Arg Ala Gly Arg His Arg Leu Arg Asp Arg Phe Arg 
370 375 380 



Arg Arg Gly Arg Gin Leu Pro Pro His Gly Asp Gin Arg Ala Arg Arg 
385 390 395 400 



Arg Glu His Gin Arg Pro Leu Gly Arg Ala Ala Hi3 Gin Leu Pro Gly 
405 410 415 



Arg Phe His Ser Glu Val Pro Gin Leu Val His Gly Ala Gly Thr Gin 
420 425 430 



Arg Pro Val His Glu Pro Ala Ala Glu His Arg Asp Ala Gly Arg Met 
435 440 445 



Asp Gin Arg His Gly Gly Leu Arg Gly Gly Lys Arg Asn Pro Gly Asp 
450 455 460 



Arg Ala Asp Pro Gly Gly Arg Ser Arg Val Asp Arg Asp Val His Thr 
465 " 470 475 480 



Asp Arg Glu His Asp Gly Val His Gin Gly Arg Phe Met Asp Leu Arg 
485 490 495 
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Arg Glu Arg Ser Gly Gin Glu Ala Gin Arg Ala Val Leu Ser Gly Arg 
500 505 510 

Pro Gly Gin Leu Pro Arg Arg Pro Gly Arg Cys His Arg Gin Arg lie 
515 520 525 

Pro Arg Leu Glx 
530 

<210> 13 

<211> 1662 

<212> DNA 

<213> Brevibacterium ap. HOI 



<220> 

<221> CDS 

<222> (1)..(1662) 

<223> 



<400> 13 

atg cca att aca caa caa ctt gac cac gac get ate gtc ate ggc gec 48 
Met Pro He Thr Gin Gin Leu Asp His Asp Ala He Val He Gly Ala 
15 10 15 

ggc ttc tec gga eta gee att ctg cac cac ctg cgt gaa ate ggc eta 96 
Gly Phe Ser Gly Leu Ala He Leu His His Leu Arg Glu He Gly Leu 
20 25 30 

gac act caa ate gtc gaa gca acc gac ggc att gga gga act tgg tgg 144 
Asp Thr Gin He Val Glu Ala Thr Asp Gly He Gly Gly Thr Trp Trp 
35 40 45 

ate aac cgc tac ccg ggg gtg egg acc gac age gag ttc cac tac tac 192 
He Asn Arg Tyr Pro Gly Val Arg Thr Asp Ser Glu Phe His Tyr Tyr 
50 55 60 

tct ttc age ttc age aag gaa gtt cgt gac gag tgg aca tgg act caa 240 
Ser Phe Ser Phe Ser Lys Glu Val Arg Asp Glu Trp Thr Trp Thr Gin 
65 70 75 80 

cgc tac cca gac ggt gaa gaa gtt tgc gee tat etc aat ttc att get 288 
Arg Tyr Pro Asp Gly Glu Glu Val Cys Ala Tyr Leu Asn Phe He Ala 
85 90 95 

gat cga ctt gat ctt egg aag gac att cag etc aac tea cga gtg aat 336 
Asp Arg Leu Asp Leu Arg Lys Asp He Gin Leu Asn Ser Arg Val Asn 
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act gcc cgt tgg aat gag acg gaa aag tac tgg gac gtc att ttc gaa 
Thr Ala Arg Trp Asn Glu Thr Glu Lys Tyr Trp Asp Val lie Phe Glu 
115 120 125 

gac ggg tec teg aaa cgc get cgc ttc etc ate age gca atg ggt gca 
Asp Gly Ser Ser Lys Arg Ala Arg Phe Leu lie Ser Ala Met Gly Ala 
130 135 140 

ctt age cag gcg att ttc ccg gcc ate gac gga ate gac gaa ttc aac 
Leu Ser Gin Ala lie Phe Pro Ala lie Asp Gly He Asp Glu Phe Asn 
145 150 155 160 

ggc gcg aaa tat cac act gcg get tgg cca get gat ggc gta gat ttc 
Gly Ala Lys Tyr His Thr Ala Ala Trp Pro Ala Asp Gly Val Asp Phe 
1S5 170 175 

acg ggc aag aag gtt gga gtc att ggg gtt ggg gcc teg gga att caa 
Thr Gly Lys Lys Val Gly Val He Gly Val Gly Ala Ser Gly He Gin 
180 185 190 

ate att ccc gag etc gcc aag ttg get ggc gaa eta ttc gta ttc cag 
He He Pro Glu Leu Ala Lys Leu Ala Gly Glu Leu Phe Val Phe Gin 
195 200 205 

cga act ccg aac tat gtg gtt gag age aac aac gac aaa gtt gac gcc 
Arg Thr Pro Asn Tyr Val Val Glu Ser Asn Asn Asp Lys Val Asp Ala 
210 215 220 

gag tgg atg cag tac gtt cgc gac aac tat gac gaa att ttc gaa cgc 
Glu Trp Met Gin Tyr Val Arg Asp Asn Tyr Asp Glu He Phe Glu Arg 
225 230 235 240 

gca tec aag cac ccg ttc ggg gtc gat atg gag tat ccg acg gat tec 
Ala Ser Lys His Pro Phe Gly Val Asp Met Glu Tyr Pro Thr Asp Ser 
245 250 255 

gcc gtc gag gtt tea gaa gaa gaa cgt aag cga gtc ttt gaa age aaa 
Ala Val Glu Val Ser Glu Glu Glu Arg Lys Arg Val Phe Glu Ser Lys 
260 265 270 

tgg gag gag gga ggc ttc cat ttt gca aac gag tgt ttc acg gac ctg 
Trp Glu Glu Gly Gly Phe His Phe Ala Asn Glu Cys Phe Thr Asp Leu 
275 280 285 

ggt acc agt cct gag gee age gag ctg gcg tea gag ttc ata cgt teg 
Gly Thr Ser Pro Glu Ala Ser Glu Leu Ala Ser Glu Phe He Arg Ser 
290 295 300 

aag att egg gag gtc gtt aag gac ccc get acg gca gat etc ctt tgt 
Lys He Arg Glu Val Val Lys Asp Pro Ala Thr Ala Asp Leu Leu Cys 
305 310 315 320 

ccc aag teg tac teg ttc aac ggt aag cga gtg ccg acc ggc cac ggc 
Pro Lys Ser Tyr Ser Phe Asn Gly Lys Arg Val Pro Thr Gly His Gly 
325 330 335 

tac tac gag acg ttc aat cgc acg aat gtg cac ctt ttg gat gcc agg 
Tyr Tyr Glu Thr Phe Asn Arg Thr Asn Val His Leu Leu Asp Ala Arg 
340 345 350 
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ggc act cca att act egg ate age age aaa ggt ate gtt cac gga gac 
Gly Thr Pro lie Thr Arg He Ser Ser Lys Gly He Val His Gly Asp 
355 360 365 

acc gaa tac gaa eta gat gca ate gtg ttc gca ace ggc ttc gac gcg 
Thr Glu Tyr Glu Leu Asp Ala He Val Phe Ala Thr Gly Phe Asp Ala 
370 375 380 

atg aca ggt acg etc acc aac att gac ate gtc ggc cgc gac gga gtc 
Met Thr Gly Thr Leu Thr Asn He Asp He Val Gly Arg Asp Gly Val 
385 390 395 400 

ate etc cgc gac aag tgg gee cag gat ggg ctt agg aca aac att ggt 
He Leu Arg Asp Lys Trp Ala Gin Asp Gly Leu Arg Thr Asn He Gly 
405 410 415 

ctt act gta aac ggc ttc ccg aac ttc ctg atg tct ctt gga cct cag 
Leu Thr Val Asn Gly Phe Pro Asn Phe Leu Met Ser Leu Gly Pro Gin 
420 425 430 

acc ccg tac tec aac ctt gtt gtt cct att cag ttg gga gee caa tgg 
Thr Pro Tyr Ser Asn Leu Val Val Pro He Gin Leu Gly Ala Gin Trp 
435 440 445 

atg cag cga ttc ctt aag ttc att cag gaa cgc ggc att gaa gtg ttc 
Met Gin Arg Phe Leu Lys Phe He Gin Glu Arg Gly He Glu Val Phe 
450 455 460 

gag teg teg aga gaa get gaa gaa ate tgg aat gec gaa acc att cgc 
Glu Ser Ser Arg Glu Ala Glu Glu He Trp Asn Ala Glu Thr He Arg 
465 470 475 480 

ggc get gaa tct acg gtc atg tec ate gaa gga ccc aaa gee ggc gca 
Gly Ala Glu Ser Thr Val Met Ser He Glu Gly Pro Lys Ala Gly Ala 
485 490 495 

tgg ttc ate ggc ggc aac att ccc ggt aaa tea cgt gag tac cag gtg 
Trp Phe He Gly Gly Asn He Pro Gly Lys Ser Arg Glu Tyr Gin Val 
500 505 510 

tat atg ggc ggc ggt cag gtc tac cag gac tgg tgc cgc gag gcg gaa 
Tyr Met Gly Gly Gly Gin Val Tyr Gin Asp Trp Cys Arg Glu Ala Glu 
515 520 525 

gaa tec gac tac gec act ttt ctg aat get gac tec att gac ggc gaa 
Glu Ser Asp Tyr Ala Thr Phe Leu Asn Ala Asp Ser He Asp Gly Glu 
530 535 540 



aag gtt cgt gaa teg gcg ggt atg aaa tag 
Lys Val Arg Glu Ser Ala Gly Met Lys 
545 550 



<210> 14 
<211> 553 
<212> PRT 
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<213> Brevibacterium sp. HCU 
<400> 14 

Met Pro He Thr Gin Gin Leu Asp His Asp Ala He Val He Gly Ala 
15 10 15 

Gly Phe Ser Gly Leu Ala He Leu His His Leu Arg Glu He Gly Leu 
20 25 30 

Asp Thr Gin He Val Glu Ala Thr Asp Gly He Gly Gly Thr Trp Trp 
35 40 45 

He Asn Arg Tyr Pro Gly Val Arg Thr Asp Ser Glu Phe His Tyr Tyr 
50 55 60 

Ser Phe Ser Phe Ser Lys Glu Val Arg Asp Glu Trp Thr Trp Thr Gin 
65 70 75 " 80 

Arg Tyr Pro Asp Gly Glu Glu Val Cys Ala Tyr Leu Asn Phe He Ala 



Asp Arg Leu Asp Leu Arg Lys Asp He Gin Leu Asn Ser Arg Val Asn 
100 105 110 



Thr Ala Arg Trp Asn Glu Thr Glu Lys Tyr Trp Asp Val He Phe Glu 
115 120 125 



Asp Gly Ser Ser Lys Arg Ala Arg Phe Leu He Ser Ala Met Gly Ala 
130 135 140 



Leu Ser Gin Ala He Phe Pro Ala He Asp Gly He Asp Glu Phe Asn 
145 150 155 160 



Gly Ala Lys Tyr His Thr Ala Ala Trp Pro Ala Asp Gly Val Asp Phe 
165 170 175 



Thr Gly Lys Lys Val Gly Val He Gly Val Gly Ala Ser Gly He Gin 
180 185 190 



He He Pro Glu Leu Ala Lys Leu Ala Gly Glu Leu Phe Val Phe Gin 
195 200 205 



Arg Thr Pro Asn Tyr Val Val Glu Ser Asn Asn Asp Lys Val Asp Ala 
210 215 220 
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Ala Ser Lys His Pro Phe Gly Val Asp Met Glu Tyr Pro Thr Asp Ser 
245 250 255 



Ala Val Glu Val Ser Glu Glu Glu Arg Lys Arg Val Phe Glu Ser Lys 
260 265 270 



Trp Glu Glu Gly Gly Phe His Phe Ala Asn Glu Cys Phe Thr Asp Leu 
275 280 285 



Gly Thr Ser Pro Glu Ala Ser Glu Leu Ala Ser Glu Phe lie Arg Ser 
290 295 ' 300 



Lys lie Arg Glu Val Val Lys Asp Pro Ala Thr Ala Asp Leu Leu Cys 
305 310 315 320 



Pro Lys Ser Tyr Ser Phe Asn Gly Lys Arg Val Pro Thr Gly His Gly 
325 330 335 



Tyr Tyr Glu Thr Phe Asn Arg Thr Asn Val His Leu Leu Asp Ala Arg 
340 345 350 



Gly Thr Pro lie Thr Arg lie Ser Ser Lys Gly lie Val His Gly Asp 
355 360 365 



Thr Glu Tyr Glu Leu Asp Ala lie Val Phe Ala Thr Gly Phe Asp Ala 
370 375 380 



Met Thr Gly Thr Leu Thr Asn He Asp He Val Gly Arg Asp Gly Val 
385 390 395 400 



He Leu Arg Asp Lys Trp Ala Gin Asp Gly Leu Arg Thr Asn He Gly 
405 410 415 



Leu Thr Val Asn Gly Phe Pro Asn Phe Leu Met Ser Leu Gly Pro Gin 
420 " 425 430 



Thr Pro Tyr Ser Asn Leu Val Val Pro He Gin Leu Gly Ala Gin Trp 
435 440 445 



Met Gin Arg Phe Leu Lys Phe He Gin Glu Arg Gly He Glu Val Phe 
450 455 460 
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Glu Ser Ser Arg Glu Ala Glu Glu lie Trp Asn Ala Glu Thr lie Arg 
465 470 475 480 

Gly Ala Glu Ser Thr Val Met Ser lie Glu Gly Pro Lye Ala Gly Ala 
485 490 495 

Trp Phe He Gly Gly Asn He Pro Gly Lys Ser Arg Glu Tyr Gin Val 
500 505 510 

Tyr Met Gly Gly Gly Gin Val Tyr Gin Asp Trp Cys Arg Glu Ala Glu 
515 520 525 

Glu Ser Asp Tyr Ala Thr Phe Leu Asn Ala Asp Ser He Asp Gly Glu 
530 535 540 

Lys Val Arg Glu Ser Ala Gly Met Lys 
545 550 

<210> 15 

<211> 1590 

<212> DNA 

<213> Brevibacterium sp. HOT 
<220> 

<221> CDS 

<222> (1)..(1590) 

<223> 



<400> 15 

atg acg tea acc atg cct gca ccg aca gca gca cag gcg aac gca gac 
Met Thr Ser Thr Met Pro Ala Pro Thr Ala Ala Gin Ala Asn Ala Asp 



gag acc gag gtc etc gac gca etc ate gtg ggt ggc gga ttc teg ggg 
Glu Thr Glu Val Leu Asp Ala Leu He Val Gly Gly Gly Phe Ser Gly 



cct gta tct gtc gac cgc ctg cgt gaa gac ggg ttc aag gtc aag gtc 
Pro Val Ser Val Asp Arg Leu Arg Glu Asp Gly Phe Lys Val Lys Val 



tgg gac gec gec ggc gga ttc ggc ggc ate tgg tgg tgg aac tgc tac 
Trp Asp Ala Ala Gly Gly Phe Gly Gly He Trp Trp Trp Asn Cys Tyr 
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ccg ggt get cgt acg gac age acc gga cag ate tat cag ttc cag tac 
Pro Gly Ala Arg Thr Asp Ser Thr Gly Gin lie Tyr Gin Phe Gin Tyr 



aag gac ctg tgg aag gac ttc gac ttc aag gag etc tac ccc gac ttc 
Lya Asp Leu Trp Lys Asp Phe Asp Phe Lys Glu Leu Tyr Pro Asp Phe 



aac ggg gtt egg gag tac ttc gag tac gtc gac teg cag etc gac ctg 
Asn Gly Val Arg Glu Tyr Phe Glu Tyr Val Asp Ser Gin Leu Asp Leu 
100 105 110 

tec cgc gac gtc aca ttc aac acc ttt gcg gag tec tgc aca tgg gac 
Ser Arg Asp Val Thr Phe Asn Thr Phe Ala Glu Ser Cys Thr Trp Asp 
115 120 125 

gae get gee aag gag tgg acg gtg cga teg teg gaa gga cgt gag cag 
Asp Ala Ala Lys Glu Trp Thr Val Arg Ser Ser Glu Gly Arg Glu Gin 
130 135 140 

egg gec cgt gcg gtc ate gtc gec acc ggc ttc ggt gcg aag ccc etc 
Arg Ala Arg Ala Val lie Val Ala Thr Gly Phe Gly Ala Lys Pro Leu 
145 " 150 155 160 

tac ccg aac ate gag ggc etc gac age ttc gaa ggc gag tgc cat cac 
Tyr Pro Asn lie Glu Gly Leu Asp Ser Phe Glu Gly Glu Cys His His 
165 170 175 

acc gca cgc tgg ccg cag ggt ggc etc gac atg acg ggc aag cga gtc 
Thr Ala Arg Trp Pro Gin Gly Gly Leu Asp Met Thr Gly Lys Arg Val 
180 185 190 

gtc gtc atg ggc acc ggt get tec ggc ate cag gtc att caa gaa gec 
Val Val Met Gly Thr Gly Ala Ser Gly He Gin Val He Gin Glu Ala 
195 200 205 

gcg gcg gtt gee gaa cac etc acc gtc ttc cag cgc acc ccg aac ctt 
Ala Ala Val Ala Glu His Leu Thr Val Phe Gin Arg Thr Pro Asn Leu 
210 215 220 

gec ctg cog atg egg cag cag egg ctg teg gec gat gac aac gat cgc 
Ala Leu Pro Met Arg Gin Gin Arg Leu Ser Ala Asp Asp Asn Asp Arg 
225 230 235 240 

tae cga gag aac ate gaa gat cgt ttc caa ate cgt gac aat teg ttt 
Tyr Arg Glu Asn He Glu Asp Arg Phe Gin He Arg Asp Asn Ser Phe 
245 250 255 

gec gga ttc gac ttc tac ttc ate ccg cag aac gec gcg gac acc. cec 
Ala Gly Phe Asp Phe Tyr Phe He Pro Gin Asn Ala Ala Asp Thr Pro 
260 265 270 

gag gac gag egg acc gcg ate tac gaa aag atg tgg gac gaa ggc gga 
Glu Asp Glu Arg Thr Ala He Tyr Glu Lys Met Trp Asp Glu Gly Gly 
275 280 285 

ttc cca ctg tgg etc gga aac ttc cag gga etc etc acc gat gag gca 
Phe Pro Leu Trp Leu Gly Asn Phe Gin Gly Leu Leu Thr Asp Glu Ala 
290 295 300 

gec aac cac acc ttc tac aac ttc tgg cgt teg aag gtg cac gat cgt 
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Ala Asn His Thr Phe Tyr Asn Phe Trp Arg Ser Lys Val His Asp Arg 
3 °5 310 315 320 

gtg aag gat ccc aag acc gcc gag atg etc gca ccg gcg acc cca ccg 
Val Lys Asp Pro Lys Thr Ala Glu Met Leu Ala Pro Ala Thr Pro Pro 
325 330 335 

cac ccg ttc ggc gtc aag cgt ccc teg etc gaa cag aac tac ttc gac 
His Pro Phe Gly Val Lys Arg Pro Ser Leu Glu Gin Asn Tyr Phe Asp 
340 345 350 

gta tac aac cag gac aat gtc gat etc ate gac teg aat gcc acc ccg 
Val Tyr Asn Gin Asp Asn Val Asp Leu He Asp Ser Asn Ala Thr Pro 
355 360 365 

ate acc egg gtc ctt ccg aae ggg gtc gaa acc ccg gac gga gtc gtc 
He Thr Arg Val Leu Pro Asn Gly Val Glu Thr Pro Asp Gly Val Val 
370 375 380 

gaa tgc gat gtc etc gtg ctg gcc acc ggc ttc gac aac aac age ggc 
Glu Cys Asp Val Leu Val Leu Ala Thr Gly Phe Asp Asn Asn Ser Gly 
385 390 395 400 

ggc ate aac gcc ate gat ate aaa gcc ggc ggg cag ctg ctg cgt gac 
Gly He Asn Ala He Asp He Lys Ala Gly Gly Gin Leu Leu Arg Asp 
405 410 415 

aag tgg gcg acc ggc gtg gac acc tac atg ggg ctg teg acg cac gga 
Lys Trp Ala Thr Gly Val Asp Thr Tyr Met Gly Leu Ser Thr His Gly 
420 425 430 

ttc ccc aat etc atg ttc etc tac ggc ccg cag age cct teg ggc ttc 
Phe Pro Asn Leu Met Phe Leu Tyr Gly Pro Gin Ser Pro Ser Gly Phe 
435 440 445 

tgc aat ggg acc gac ttc ggc gga gcg cca ggc gat atg gtc gcc gac 
Cys Asn Gly Thr Asp Phe Gly Gly Ala Pro Gly Asp Met Val Ala Asp 
450 455 460 

ttc etc ate tgg etc aag gac aac ggc ate teg egg ttc gaa tec acc 
Phe Leu He Trp Leu Lys Asp Asn Gly He Ser Arg Phe Glu Ser Thr 
465 470 475 480 

gaa gag gtc gag egg gaa tgg cgc gcc cat gtc gac gac ate ttc gtc 
Glu Glu Val Glu Arg Glu Trp Arg Ala His Val Asp Asp He Phe Val 
485 490 495 

aac teg ctg ttc ccc aag gcg aag tec tgg tac tgg ggc gcc aac gtc 
Asn Ser Leu Phe Pro Lys Ala Lys Ser Trp Tyr Trp Gly Ala Asn Val 
500 505 510 

ccc ggc aag ccg gcg cag atg etc aac tat teg gag gcg tec ccg cat 
Pro Gly Lys Pro Ala Gin Met Leu Asn Tyr Ser Glu Ala Ser Pro His 
515 520 525 
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<210> 16 
<211> 529 
<212> PRT 

<213> Brevibacterium sp. HCU 
<400> 16 

Met Thr Ser Thr Met Pro Ala Pro Thr Ala Ala Gin Ala Asn Ala Asp 
15 10 15 

Glu Thr Glu Val Leu Asp Ala Leu lie Val Gly Gly Gly Phe Ser Gly 
20 25 30 

Pro Val Ser Val Asp Arg Leu Arg Glu Asp Gly Phe Lys Val Lys Val 
35 40 45 

Trp Asp Ala Ala Gly Gly Phe Gly Gly He Trp Trp Trp Asn Cys Tyr 
50 55 60 

Pro Gly Ala Arg Thr Asp Ser Thr Gly Gin He Tyr Gin Phe Gin Tyr 
65 70 75 80 

Lys Asp Leu Trp Lys Asp Phe Asp Phe Lys Glu Leu Tyr Pro Asp Phe 



Asn Gly Val Arg Glu Tyr Phe Glu Tyr Val Asp Ser Gin Leu Asp Leu 
100 105 110 



Ser Arg Asp Val Thr Phe Asn Thr Phe Ala Glu Ser Cys Thr Trp Asp 
115 120 125 



Asp Ala Ala Lys Glu Trp Thr Val Arg Ser Ser Glu Gly Arg Glu Gin 
130 135 140 



Arg Ala Arg Ala Val He Val Ala Thr Gly Phe Gly Ala Lys Pro Leu 
145 150 155 160 



Tyr Pro Asn He Glu Gly Leu Asp Ser Phe Glu Gly Glu Cys His His 
165 170 175 



Thr Ala Arg Trp Pro Gin Gly Gly Leu Asp Met Thr Gly Lys Arg Val 
180 185 190 



Val Val Met Gly Thr Gly Ala Ser Gly He Gin Val He Gin Glu Ala 
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Ala Ala Val Ala Glu His Leu Thr Val Phe Gin Arg Thr Pro Asn Leu 
210 215 220 



Ala Leu Pro Met Arg Gin Gin Arg Leu Ser Ala Asp Asp Asn Asp Arg 
225 230 235 240 



Tyr Arg Glu Asn lie Glu Asp Arg Phe Gin lie Arg Asp Asn Ser Phe 
245 250 255 



Ala Gly Phe Asp Phe Tyr Phe lie Pro Gin Asn Ala Ala Asp Thr Pro 
260 265 270 



Glu Asp Glu Arg Thr Ala lie Tyr Glu Lys Met Trp Asp Glu Gly Gly 
275 280 285 



Phe Pro Leu Trp Leu Gly Asn Phe Gin Gly Leu Leu Thr Asp Glu Ala 
290 295 300 



Ala Asn His Thr Phe Tyr Asn Phe Trp Arg Ser Lys Val His Asp Arg 
305 310 315 320 



Val Lys Asp Pro Lys Thr Ala Glu Met Leu Ala Pro Ala Thr Pro Pro 
325 330 335 



His Pro Phe Gly Val Lys Arg Pro Ser Leu Glu Gin Asn Tyr Phe Asp 
340 345 350 



Val Tyr Asn Gin Asp Asn Val Asp Leu lie Asp Ser Asn Ala Thr Pro 
355 360 365 



He Thr Arg Val Leu Pro Asn Gly Val Glu Thr Pro Asp Gly Val Val 
370 375 380 



Glu Cys Asp Val Leu Val Leu Ala Thr Gly Phe Asp Asn Asn Ser Gly 
385 390 395 400 



Gly He Asn Ala He Asp He Lys Ala Gly Gly Gin Leu Leu Arg Asp 
405 410 415 



Lys Trp Ala Thr Gly Val Asp Thr Tyr Met Gly Leu Ser Thr His Gly 
420 425 430 



Phe Pro Asn Leu Met Phe Leu Tyr Gly Pro Gin Ser Pro Ser Gly Phe 
435 440 445 
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Cys Asn Gly Thr Asp Phe Gly Gly Ala Pro Gly Asp Met Val Ala Asp 
450 455 460 

Phe Leu lie Trp Leu Lys Asp Asn Gly He Ser Arg Phe Glu Ser Thr 
465 470 475 480 

Glu Glu Val Glu Arg Glu Trp Arg Ala His Val Asp Asp He Phe Val 
485 490 495 

Asn Ser Leu Phe Pro Lys Ala Lys Ser Trp Tyr Trp Gly Ala Asn Val 
500 505 510 

Pro Gly Lys Pro Ala Gin Met Leu Asn Tyr Ser Glu Ala Ser Pro His 
515 520 525 

He 

<210> 17 
<211> 1614 
<212> DNA 

<213> Brachymonas sp. CHX 



<400> 17 

atgtcttcct cgccaagcag 


cgccattcat 


ttcgatgcca tcgttgtggg cgccggattt 


60 


ggcggcatgt atatgctgca 


caaactgcgc 


gaccagctcg gactcaaggt caaggttttc 


120 


gacacagccg gcggcatcgg 


cggcacctgg 


tattggaatc gctatcctgg agccttgtcc 


180 


gacacgcaca gtcatgtcta 


tcagtattct 


ttcgacgaag cgatgctcca agaatggaca 


240 


tggaagaaca aatacctcac 


gcagccagaa 


atactggctt atctggagta tgtagcagac 


300 


cggctcgatc tgcgcccgga 


cattcagttg 


aacacgaccg tgacatcgat gcatttcaat 


360 


gaagtccaca acatctggga 


agtgcgcacg 


gaccggggcg ggtactacac cgcgcgcttt 


420 


atcgtgacgg cactgggttt 


gttatccgcg 


atcaactggc ccaacattcc gggccgcgaa 


480 


agcttccaag gcgagatgta 


tcacacagcc 


gcctggccaa aagatgtcga actgcgcggc 


540 


aaacgcgtcg gcgtgatcgg 


caccggctcg 


acgggtgtgc agctgattac cgccatcgct 


600 


ccagaggtca aacacctgac 


ggtcttccag 


cgtacaccgc aatacagcgt gccgacggga 


660 


aatcgtcctg tctccgcgca 


agaaatcgca 


gaagtcaagc gaaacttcag caaggtatgg 


720 
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caacaagtac 


gtgaatccgc 


cgtcgcattc 


ggcttcgagg 


aaagcacagt 


gcccgcgatg 


780 


agcgtctccg 


aagccgaacg 


ccagcgcgtc 


tttcaggaag 


cctggaacca 


aggcaacggc 


840 


ttttactaca 


tgttcggcac 


attttgcgac 


atcgccaccg 


acccgcaggc 


caacgaagcc 


900 


gcagccacct 


tcatacgcaa 


caaaatcgcc 


gagatcgtca 


aagacccgga 


aaccgcccgc 


960 


aagctcacgc 


ctacggatgt 


ttacgcccga 


cgcccgcttt 


gcgacagtgg 


ctactatcgc 


1020 


acctacaacc 


gcagcaacgt 


ctcactggtg 


gatgtgaagg 


cgacaccaat 


cagtgcgatg 


1080 


acgccccggg 


gcattcgcac 


cgccgacggt 


gtcgagcacg 


agttggatat 


gttgatcctt 


1140 


gccactggct 


atgacgccgt 


cgatggcaat 


taccgccgca 


tcgacctgcg 


cggccgtggc 


1200 


ggccaaacca 


tcaatgagca 


ctggaacgac 


actcctacca 


gttatgtagg 


ggtcagcacc 


1260 


gccaacttcc 


ccaacatgtt 


catgatcctg 


ggcccgaatg 


gcccattcac 


gaacctgccg 


1320 


ccgtcgatcg 


aagcacaggt 


cgaatggatc 


accgacctgg 


ttgcccacat 


gcgccagcac 


1380 


gggctcgcga 


cggccgaacc 


aacgcgcgat 


gctgaagatg 


cctggggccg 


cacctgcgcg 


1440 


gaaatcgccg 


agcagacgct 


ttttggccag 


gttgaatcat 


ggatcttcgg 


tgccaacagc 


1500 


cccgggaaga 


aacatacttt 


gatgttctat 


ctggccggcc 


tggggaacta 


ccgcaagcag 


1560 


ctcgccgacg 


tagcgaacgc 


gcaataccaa 


ggctttgcgt 


tccaaccact 


gtaa 


1614 



<210> 18 
<211> 538 
<212> PRT 

<213> Brachymonas sp. CHX 
<400> 18 

Met Ser Ser Ser Pro Ser Ser Ala He His Phe Asp Ala He Val Val 
15 10 15 



Gly Ala Gly Phe Gly Gly Met Tyr Met Leu His Lys Leu Arg Asp Gin 
20 25 30 



Leu Gly Leu Lys Val Lys Val Phe Asp Thr Ala Gly Gly lie Gly Gly 
35 40 45 



Thr Trp Tyr Trp Asn Arg Tyr Pro Gly Ala Leu Ser Asp Thr His Ser 
50 55 60 



His Val Tyr Gin Tyr Ser Phe Asp Glu Ala Met Leu Gin Glu Trp Thr 
65 70 75 80 
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Trp Lys Asn Lys Tyr Leu Thr Gin Pro Glu lie Leu Ala Tyr Leu Glu 



Tyr Val Ala Asp Arg Leu Asp Leu Arg Pro Asp lie Gin Leu Asn Thr 
100 105 110 



Thr Val Thr Ser Met His Phe Asn Glu Val His Asn lie Trp Glu Val 
115 120 125 



Arg Thr Asp Arg Gly Gly Tyr Tyr Thr Ala Arg Phe lie Val Thr Ala 
130 * 135 140 



Leu Gly Leu Leu Ser Ala lie Asn Trp Pro Asn He Pro Gly Arg Glu 
145 150 155 160 



Ser Phe Gin Gly Glu Met Tyr His Thr Ala Ala Trp Pro Lys Asp Val 
165 170 175 



Glu Leu Arg Gly Lys Arg Val Gly Val He Gly Thr Gly Ser Thr Gly 
180 185 190 



Val Gin Leu He Thr Ala He Ala Pro Glu Val Lys His Leu Thr Val 
195 200 205 



Phe Gin Arg Thr Pro Gin Tyr Ser Val Pro Thr Gly Asn Arg Pro Val 
210 215 220 



Ser Ala Gin Glu He Ala Glu Val Lys Arg Asn Phe Ser Lys Val Trp 
225 230 235 240 



Gin Gin Val Arg Glu Ser Ala Val Ala Phe Gly Phe Glu Glu Ser Thr 
245 250 255 



Val Pro Ala Met Ser Val Ser Glu Ala Glu Arg Gin Arg Val Phe Gin 
2S0 265 270 



Glu Ala Trp Asn Gin Gly Asn Gly Phe Tyr Tyr Met Phe Gly Thr Phe 
275 280 285 



Cys Asp He Ala Thr Asp Pro Gin Ala Asn Glu Ala Ala Ala Thr Phe 
290 295 300 



He Arg Asn Lys He Ala Glu He Val Lys Asp Pro Glu Thr Ala Arg 
305 310 315 320 
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Lys Leu Thr Pro Thr Asp Val Tyr Ala Arg Arg Pro Leu Cys Asp Ser 
325 330 335 



Gly Tyr Tyr Arg Thr Tyr Asn Arg Ser Asn Val Ser Leu Val Asp Val 
340 345 350 



Lys Ala Thr Pro He Ser Ala Met Thr Pro Arg Gly He Arg Thr Ala 
355 360 365 



Asp Gly Val Glu His Glu Leu Asp Met Leu He Leu Ala Thr Gly Tyr 
370 375 380 



Asp Ala Val Asp Gly Asn Tyr Arg Arg He Asp Leu Arg Gly Arg Gly 
385 390 395 400 



Gly Gin Thr He Asn Glu His Trp Asn Asp Thr Pro Thr Ser Tyr Val 
405 410 415 



Gly Val Ser Thr Ala Asn Phe Pro Asn Met Phe Met He Leu Gly Pro 
420 425 430 



Asn Gly Pro Phe Thr Asn Leu Pro Pro Ser He Glu Ala Gin Val Glu 
435 440 445 



Trp He Thr Asp Leu Val Ala His Met Arg Gin His Gly Leu Ala Thr 
450 455 460 



Ala Glu Pro Thr Arg Asp Ala Glu Asp Ala Trp Gly Arg Thr Cys Ala 
465 470 475 480 



Glu He Ala Glu Gin Thr Leu Phe Gly Gin Val Glu Ser Trp He Phe 
485 490 495 



Gly Ala Asn Ser Pro Gly Lys Lys His Thr Leu Met Phe Tyr Leu Ala 
500 505 510 



Gly Leu Gly Asn Tyr Arg Lys Gin Leu Ala Asp Val Ala Asn Ala Gin 
515 520 525 



Tyr Gin Gly Phe Ala Phe Gin Pro Leu Glx 
530 535 



<210> 19 
<211> 1644 
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<212> DNA 

<213> Acinetobacter sp. SE19 
<220> 

<221> CDS 

<222> (1)..(1644) 

<223> 



<400> 19 

atg gag att ate atg. tea caa aaa atg gat ttt gat get ate gtg att 
Met Glu He He Met Ser' Gin Lys Met Asp Phe Asp Ala He Val He 



ggt ggt ggt ttt ggc gga ctt tat gca gtc aaa aaa tta aga gac gag 
Gly Gly Gly Phe Gly Gly Leu Tyr Ala Val Lys Lys Leu Arg Asp Glu 
20 25 30 

etc gaa ctt aag gtt cag get ttt gat aaa gee acg gat gtc gca ggt 
Leu Glu Leu Lys Val Gin Ala Phe Asp Lys Ala Thr Asp Val Ala Gly 



act tgg tac tgg aac cgt tac cca ggt gca ttg teg gat aca gaa ace 
Thr Trp Tyr Trp Asn Arg Tyr Pro Gly Ala Leu Ser Asp Thr Glu Thr 



cac etc tac tgc tat tct tgg gat aaa gaa tta eta caa teg eta gaa 
His Leu Tyr Cys Tyr Ser ,Trp Asp Lys Glu Leu Leu Gin Ser Leu Glu 



ate aag aaa aaa tat gtg caa ggc cct gat gta cgc aag tat tta cag 
He Lys Lys Lys Tyr Val Gin Gly Pro Asp Val Arg Lys Tyr Leu Gin 



caa gtg get gaa aag cat gat tta aag aag age tat caa ttc aat acc 
Gin Val Ala Glu Lys His Asp Leu Lys Lys Ser Tyr Gin Phe Asn Thr 
100 105 110 

gcg gtt caa teg get cat tac aac gaa gca gat gec ttg tgg gaa gtc 
Ala Val Gin Ser Ala His Tyr Asn Glu Ala Asp Ala Leu Trp Glu Val 
115 120 125 

acc act gaa tat ggt gat aag tac acg gcg cgt ttc etc ate act get 
Thr Thr Glu Tyr Gly Asp Lys Tyr Thr Ala Arg Phe Leu He Thr Ala 
130 ' 135 140 

tta ggc tta ttg tct gcg cct aac ttg cca aac ate aaa ggc att aat 
Leu Gly Leu Leu Ser Ala Pro Asn Leu Pro Asn He Lys Gly He Asn 
145 150 155 160 

cag ttt aaa ggt gag ctg cat cat acc age cgc tgg cca gat gac gta 
Gin Phe Lys Gly Glu Leu His His Thr Ser Arg Trp Pro Asp Asp Val 
165 170 175 
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agt ttt gaa ggt 
Ser Phe Glu Gly 
180 


aaa 
Lys 


cgt gtc 
Arg Val 


ggc 
Gly 


gtg 
Val 
185 


att ggt acg 
He Gly Thr 


ggt 
Gly 


tec 
Ser 
190 


acc 
Thr 


ggt 
Gly 


576 


gtt cag gtt att 
Val Gin Val lie 
195 


a eg 
Thr 


get gtg 
Ala Val 


gca 
Ala 
200 


cct 
Pro 


ctg get aaa 
Leu Ala Lys 


cac 
His 
205 


etc 
Leu 


act 
Thr 


gtc 
Val 


624 


ttc cag cgt tct 
Phe Gin Arg Ser 
210 


Ala 


caa tac 
Gin Tyr 
215 


age 
Ser 


gtt 
Val 


cca att ggc 
Pro He Gly 
220 


aat 
Asn 


gat 


cca 
Pro 


ctg 
Leu 


672 


tct gaa gaa gat 
Ser Glu Glu Asp 
225 


gtt 
Val 


aaa aag 
Lys Lys 
230 


ate 
He 


aaa 
Lys 


gac aat tat 
Asp Asn Tyr 
235 


gac 
Asp 


aaa 


att 
He 


tgg 
Trp 
240 


720 


gat ggt gta tgg 
Asp Gly Val Trp 


aat 
Asn 
245 


tea gcc 
Ser Ala 


ctt 
Leu 


gcc 
Ala 


ttt ggc ctg 
Phe Gly Leu 
250 


aat 
Asn 


gaa 
Glu 


age 
Ser 
255 


aca 
Thr 


768 


gtg cca gca atg 
Val Pro Ala Met 
260 


age 
Ser 


gta tea 
Val Ser 


get 
Ala 


gaa 
Glu 
265 


gaa cgc aag 
Glu Arg Lys 


gca 
Ala 


gtt 
Val 
270 


ttt 
Phe 


gaa 
Glu 


816 


aag gca tgg caa 
Lys Ala Trp Gin 
275 


aca 
Thr 


ggt ggc 
Gly Gly 


ggt 
Gly 
280 


ttc 
Phe 


cgt ttc atg 
Arg Phe Met 


ttt 
Phe 
285 


gaa 
Glu 


act 
Thr 


ttc 
Phe 


864 


ggt gat att gcc 
Gly Asp lie Ala 
290 


acc 
Thr 


aat atg 
Asn Met 
295 


gaa 
Glu 


gcc 
Ala 


aat ate gaa 
Asn He Glu 
300 


gcg 
Ala 


caa 
Gin 


aat 
Asn 


ttc 
Phe 


912 


att aag ggt aaa 
He Lys Gly Lys 
305 


att 
He 


get gaa 
Ala Glu 
310 


ate 
He 


gtc 
Val 


aaa gat cca 
Lys Asp Pro 
315 


gcc 
Ala 


att 
He 


gca 
Ala 


cag 
Gin 

320 


960 


aag ctt atg cca 
Lys Leu Met Pro 


cag 
Gin 
325 


gat ttg 
Asp Leu 


tat 
Tyr 


gca 
Ala 


aaa cgt ccg 
Lys Arg Pro 
330 


ttg 
Leu 


tgt 
Cys 


gac 
Asp 
335 


agt 
Ser 


1008 


ggt tac tac aac 
Gly Tyr Tyr Asn 
340 


acc 
Thr 


ttt aac 
Phe Asn 


cgt 
Arg 


gac 
Asp 
345 


aat gtc cgt 
Asn Val Arg 


tta 
Leu 


Glu 
350 


gat 
Asp 


gtg 
val 


1056 


aaa gcc aat ccg 
Lys Ala Asn Pro 
355 


att 
He 


gtt gaa 
Val Glu 


att 
He 
360 


Thr 


gaa aac ggt 
Glu Asn Gly 


gtg 

Val 
365 


aaa 
Lys 


etc 
Leu 


gaa 
Glu 


1104 


aat ggc gat ttc 
Asn Gly Asp Phe 
370 


gtt 
Val 


gaa tta 
Glu Leu 
375 


gac 
Asp 


atg 
Met 


ctg ata tgt 
Leu He Cys 
380 


gcc 
Ala 


aca 
Thr 


ggt 
Gly 


ttt 
Phe 


1152 


gat gcc gtc gat 
Asp Ala Val Asp 
385 


ggc 

Gly 


aac tat 
Asn Tyr 
390 


gtg 
Val 


cgc 
Arg 


atg gac att 
Met Asp He 
395 


caa 
Gin 


ggt 

Gly 


aaa 
Lys 


aac 
Asn 
400 


1200 


ggc ttg gcc atg 
Gly Leu Ala Met 


aaa 
Lys 
405 


Asp Tyr 


tgg 
Trp 


aaa 
Lys 


gaa ggt ccg 
Glu Gly Pro 
410 


teg 
Ser 


age 
Ser 


tat 
Tyr 
415 


atg 
Met 


1248 


ggt gtc acc gta 


aat 


aac tat 


cca 


aae 


atg ttc atg 


gtg 


ctt 


gga 


ccg 


1296 
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Gly Val Thr Val Asn Asn Tyr Pro Asa Met Phe Met Val Leu Gly Pro 
420 425 430 

aat ggc ccg ttt acc aac ctg ccg cca tea att gaa tea cag gtg gaa 
Asn Gly Pro Phe Thr Asn Leu Pro Pro Ser He Glu Ser Gin Val Glu 
435 440 445 

tgg ate agt gat acc att caa tac acg gtt gaa aac aat gtt gaa tec 
Trp He Ser Asp Thr He Gin Tyr Thr Val Glu Asn Asn Val Glu Ser 
450 455 460 

att gaa gcg aca aaa gaa gcg gaa gaa caa tgg act caa act tgc gec 
He Glu Ala Thr Lys Glu Ala Glu Glu Gin Trp Thr Gin Thr Cys Ala 
465 470 475 480 

aat att gcg gaa atg acc tta ttc cct aaa gcg caa tec tgg att ttt 
Asn He Ala Glu Met Thr Leu Phe Pro Lys Ala Gin Ser Trp He Phe 
485 490 495 

ggt gcg aat ate ccg ggc aag aaa aac acg gtt tac ttc tat etc ggt 
Gly Ala Asn He Pro Gly Lys Lys Asn Thr Val Tyr Phe Tyr Leu Gly 
500 505 510 

ggt tta aaa gaa tat cgc agt gcg eta gee aac tgc aaa aac cat gec 
Gly Leu Lys Glu Tyr Arg Ser Ala Leu Ala Asn Cys Lys Asn His Ala 
515 520 525 

tat gaa ggt ttt gat att caa tta caa cgt tea gat ate aag caa cct 
Tyr Glu Gly Phe Asp He Gin Leu Gin Arg Ser Asp He Lys Gin Pro 
530 535 540 

gec aat gec taa 
Ala Asn Ala 
545 



<210> 20 
<211> 547 
<212> PRT 

<213> Acinetobacter sp. SE19 
<400> 20 

Met Glu He He Met Ser Gin Lys Met Asp Phe Asp Ala He Val He 
15 10 15 

Gly Gly Gly Phe Gly Gly Leu Tyr Ala Val Lys Lys Leu Arg Asp Glu 
20 25 30 

Leu Glu Leu Lys Val Gin Ala Phe Asp Lys Ala Thr Asp Val Ala Gly 
35 40 45 

Thr Trp Tyr Trp Asn Arg Tyr Pro Gly Ala Leu Ser Asp Thr Glu Thr 
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His Leu Tyr Cys Tyr Ser Trp Asp Lys Glu Leu Leu Gin Ser Leu Glu 



lie Lys Lys Lys Tyr Val Gin Gly Pro Asp Val Arg Lys Tyr Leu Gin 



Gin Val Ala Glu Lys His Asp Leu Lys Lys Ser Tyr Gin Phe Asn Thr 
100 105 110 



Ala Val Gin Ser Ala His Tyr Asn Glu Ala Asp Ala Leu Trp Glu Val 
115 120 125 



Thr Thr Glu Tyr Gly Asp Lys Tyr Thr Ala Arg Phe Leu lie Thr Ala 
130 135 140 



Leu Gly Leu Leu Ser Ala Pro Asn Leu Pro Asn lie Lys Gly lie Asn 
145 150 155 160 



Gin Phe Lys Gly Glu Leu His His Thr Ser Arg Trp Pro Asp Asp Val 
165 170 ~ 175 



Ser Phe Glu Gly Lys Arg Val Gly Val He Gly Thr Gly Ser Thr Gly 
180 185 190 



Val Gin Val He Thr Ala Val Ala Pro Leu Ala Lys His Leu Thr Val 
195 200 205 



Phe Gin Arg Ser Ala Gin Tyr Ser Val Pro He Gly Asn Asp Pro Leu 
210 215 220 



Asp Gly Val Trp Asn Ser Ala Leu Ala Phe Gly Leu Asn Glu Ser Thr 
245 250 255 



Val Pro Ala Met Ser Val Ser Ala Glu Glu Arg Lys Ala Val Phe Glu 
260 265 270 



Lys Ala Trp Gin Thr Gly Gly Gly Phe Arg Phe Met Phe Glu Thr Phe 
275 280 285 



Gly Asp He Ala Thr Asn Met Glu Ala Asn He Glu Ala Gin Asn Phe 
290 295 300 
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He Lys Gly Lys He Ala Glu He Val Lys Asp Pro Ala He Ala Gin 
305 310 315 320 



Lys Leu Met Pro Gin Asp Leu Tyr Ala Lys Arg Pro Leu Cys Asp Ser 
325 330 335 



Gly Tyr Tyr Asn Thr Phe Asn Arg Asp Asn Val Arg Leu Glu Asp Val 
340 345 350 



Lys Ala Asn Pro He Val Glu He Thr Glu Asn Gly Val Lys Leu Glu 
355 360 365 



Asn Gly Asp Phe Val Glu Leu Asp Met Leu He Cys Ala Thr Gly Phe 
370 375 380 



Asp Ala Val Asp Gly Asn Tyr Val Arg Met Asp He Gin Gly Lys Asn 
385 390 395 400 



Gly Leu Ala Met Lys Asp Tyr Trp Lys Glu Gly Pro Ser Ser Tyr Met 
405 410 415 



Gly Val Thr Val Asn Asn Tyr Pro Asn Met Phe Met Val Leu Gly Pro 
420 425 430 



Asn Gly Pro Phe Thr Asn Leu Pro Pro Ser He Glu Ser Gin Val Glu 
435 440 445 



Trp He Ser Asp Thr He Gin Tyr Thr Val Glu Asn Asn Val Glu Ser 
450 455 460 



He Glu Ala Thr Lys Glu Ala Glu Glu Gin Trp Thr Gin Thr Cys Ala 
465 470 475 480 



Asn He Ala Glu Met Thr Leu Phe Pro Lys Ala Gin Ser Trp He Phe 
485 490 495 



Gly Ala Asn He Pro Gly Lys Lys Asn Thr Val Tyr Phe Tyr Leu Gly 
500 505 510 



Gly Leu Lys Glu Tyr Arg Ser Ala Leu Ala Asn Cys Lys Asn His Ala 
515 520 525 



Tyr Glu Gly Phe Asp He Gin Leu Gin Arg Ser Asp He Lys Gin Pro 
530 ' 535 540 
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Ala Asn Ala 
545 

<210> 21 
<211> 1320 
<212> DNA 
<213> Rhodococcus 



erythropolis AN12 



<400> 21 
atgagcacag 


agggcaagta 


cgcgctgatc 


ggagcgggtc 


cgtctggatt ggccggcgcg 


60 


cgaaacctcg 


atcgagccgg 


catagcgttc 


gacggcttcg 


agagccacga cgacgtcggt 


120 


gggctctggg 


acatcgacaa 


cccgcacagc 


accgtctacg 


agtcggcgca cctcatttcg 


180 


tcgaagggca 


ccaccgcatt 


cgcggagttc 


ccgatggcgg 


attcggttgc cgactacccg 


240 


agccacatcg 


aacttgccga 


gtatttccgc 


gactacgccg 


atacccacga tcttcgcagg 


300 


cactttgcct 


tcggcactac 


cgtcatcgac 


gttttgccgg 


tcgattcgct gtggcaggtc 


360 


accacgcgta 


gtcgcagcgg 


tgagacttca 


gtcgcgcggt 


atcgaggcgt gatcatcgcg 


420 


aacggaacgc 


tgtcgaagcc 


gaacataccg 


acgttccggg 


gegacttcac cggcacgttg 


480 


atgcacacga 


gcgagtaccg 


cagtgccgag 


atcttccgcg 


gaaagagagt gctggtcatc 


540 


ggagcgggca 


acagtggatg 


cgacatcgcc 


gtcgatgccg 


tccaccaggc cgagtgcgtc 


600 


gatttgagcg 


ttcggcgagg 


ctactacttc 


gtccccaagt 


atctgttcgg gcgaccctcg 


660 


gacacgttga 


atcagggaaa 


gccgttgccg 


ccgtggatca 


aacaacgcgt cgacaccttg 


720 


ttactcaagc 


agttcacggg 


agatccggtg 


cggttcggat 


ttccggcacc ggactacaag 


780 


atctacgaat 


cgcatccggt 


cgtgaactcg 


ttgatcctgc 


accacatcgg gcacggtgac 


840 


gtgcacgtgc 


gcgccgacgt 


cgaccggttc 


gaggggaaga 


cggtgcggtt tgtcgacgga 


900 


tcgtctgccg 


actacgacct 


cgttctctgc 


gccacggggt 


atcacctcga ctatcccttc 


960 


atcgcgcgcg 


aggacctgga 


ctggtcgggt 


gctgccccgg 


acctgttcct caacgtcgcg 


1020 


agtcgocgcc 


acgacaatct 


ctttgttctc 


ggcatggtcg 


aagcatccgg tctcgggtgg 


1080 


cagggtcgtt 


accagcaggc 


cgagttggtg 


gccaaattga 


tcaccgcacg caccgaagcc 


1140 


cccgccgcgg 


cgcgcgaatt 


ctcggcagcg 


gcggccggcc 


ctcctcccga tctgtccggg 


1200 


ggatacaagt 


acctgaagct 


gggacgaatg 


gcctactacg 


tgaacaagga cgcctaccga 


1260 


tcggcgatca 


gacggcacat 


cggactgctc 


gatgccgctc 


tgacgaaggg aggtcagtga 


1320 
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<210> 22 

<21X> 439 

<212> PRT 

<213> Rhodococcus erythropolis AN12 

<400> 22 

Met Ser Thr Glu Gly Lys Tyr Ala Leu lie Gly Ala Gly Pro Ser Gly 
1 5 10 15 

Leu Ala Gly Ala Arg Aan Leu Asp Arg Ala Gly He Ala Phe Asp Gly 
20 25 30 

Phe Glu Ser His Asp Asp Val Gly Gly Leu Trp Asp He Asp Asn Pro 
35 40 45 

His Ser Thr Val Tyr Glu Ser Ala His Leu He Ser Ser Lys Gly Thr 
50 * 55 60 

Thr Ala Phe Ala Glu Phe Pro Met Ala Asp Ser Val Ala Asp Tyr Pro 
65 70 75 80 

Ser His He Glu Leu Ala Glu Tyr Phe Arg Asp Tyr Ala Asp Thr His 



Asp Leu Arg Arg His Phe Ala Phe Gly Thr Thr Val He Asp Val Leu 
100 105 110 



Pro Val Asp Ser Leu Trp Gin Val Thr Thr Arg Ser Arg Ser Gly Glu 
115 120 125 



Thr Ser Val Ala Arg Tyr Arg Gly Val He He Ala Asn Gly Thr Leu 
130 135 140 



Ser Lys Pro Asn He Pro Thr Phe Arg Gly Asp Phe Thr Gly Thr Leu 
145 150 155 160 



Met His Thr Ser Glu Tyr Arg Ser Ala Glu He Phe Arg Gly Lys Arg 
165 170 175 



Val Leu Val He Gly Ala Gly Asn Ser Gly Cys Asp He Ala Val Asp 
180 185 190 



Ala Val His Gin Ala Glu Cys Val Asp Leu Ser Val Arg Arg Gly Tyr 
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Tyr Phe Val Pro Lys Tyr Leu Phe Gly Arg Pro Ser Asp Thr Leu Asn 
210 215 220 



Gin Gly Lys Pro Leu Pro Pro Trp lie Lys Gin Arg Val Asp Thr Leu 
225 230 235 240 



Leu Leu Lys Gin Phe Thr Gly Asp Pro Val Arg Phe Gly Phe Pro Ala 
245 250 255 



Pro Asp Tyr Lys lie Tyr Glu Ser His Pro Val Val Asn Ser Leu lie 
260 2S5 270 



Leu His His lie Gly His Gly Asp Val His Val Arg Ala Asp Val Asp 
275 280 285 



Arg Phe Glu Gly Lys Thr Val Arg Phe Val Asp Gly Ser Ser Ala Asp 

290 - 295 ... 300 



Tyr Asp Leu Val Leu Cys Ala Thr Gly Tyr His Leu Asp Tyr Pro Phe 
305 310 315 320 



He Ala Arg Glu Asp Leu Asp Trp Ser Gly Ala Ala Pro Asp Leu Phe 
325 330 . 335 



Leu Asn Val Ala Ser Arg Arg His Asp Asn Leu Phe Val Leu Gly Met 
340 345 350 



Val Glu Ala Ser Gly Leu Gly Trp Gin Gly Arg Tyr Gin Gin Ala Glu 
355 360 365 



Leu Val Ala Lys Leu He Thr Ala Arg Thr Glu Ala Pro Ala Ala Ala 
370 375 380 



Arg Glu Phe Ser Ala Ala Ala Ala Gly Pro Pro Pro Asp Leu Ser Gly 
385 390 395 400 



Gly Tyr Lys Tyr Leu Lys Leu Gly Arg Met Ala Tyr Tyr Val Asn Lys 
405 410 415 
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<210> 23 

<211> 1557 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 23 
atggtcgaca 


tcgacccaac 


ctcggggcca 


tcggccggtg 


acgaggaaac tcgaactcgc 


GO 


cgaacacgag 


tcgtcgtcat 


cggagccggt 


ttcggcggca 


tcggaacggc tgtccgcttg 


12 0 


aagcagtccg 


ggatcgacga 


cttcgtcgtt 


ctggaacgtg 


ccgcggagcc cggggggacc 


180 


tggcaggtca 


atacctaccc 


cggtgcacag 


tgcgacatcc 


cgtcgattct gtactcgttc 


240 


tcgtttgcgc 


ccaatccgaa 


ctggacgcgg 


ctgtatcccc 


tgcagcccga gatctacgac 


300 


tatctccggg 


attgcgtcca 


tcgcttcgga 


ctggccggtc 


atttccactg caaccaggac 


360 


gtgacagaag 


cttcgtggga 


cgagcaagcc 


cagatctggc 


gggtacacac tgcggaaacc 


420 


gtctgggagg 


cacagttcct 


ggtcgcggcc 


accggcccgt 


tcagtgcccc cgccacaccc 


480 


gaccttcccg 


ggctcgaatc 


gtttcgtggt 


cagatgttcc 


acaccgcgga ctggaaccac 


540 


gaccacgacc 


ttcgcggtga 


gcggatagcc 


gtggtcggca 


ccggcgcctc tgcggtgcag 


600 


atcatcccca 


gactgcaacc 


gctcgcggac 


acgttgaccg 


tgttccagcg gacaccgacg 


660 


tggatcctgc 


cgcatccgga 


tcagccgatg 


accggctggc 


caagcgctct cttcgagcgg 


720 


gtgccgctca 


cccaacgact 


ggcacgcaag 


ggactcgacc 


tgcttcaaga agccctggta 


780 


cccggattcg 


tgtacaagcc 


gtcactgctc 


aaagggctgg 


ccgcactcgg ccgagcacac 


840 


cttcgccggc 


aggtgcggga 


cccggagctt 


cgcgcaaagc 


tgctccccca ctacgcattc 


900 


ggatgcaagc 


gtccgacgtt 


ctcgaacacc 


tactatcccg 


cgctggcgtc acccaatgtg 


960 


gaggtggtga 


cggacggaat 


cgtcgaggtg 


caggagcgcg 


gagttctcac cgcggacggc 


1020 


gccttccggg 


aagtcgacac 


catagtcatg 


ggaaccggct 


ttcggatggg agacaacccg 


1080 


tcgttcgaca 


ccatccgagg 


ccaggacggc 


cgcagcctcg 


cacagacgtg gaacggcagt 


1140 


gccgaggcct 


tcctcggcac 


cactatcagc 


ggttttccga 


acttcttcat gatcctcggc 


1200 


cccaattccg 


tggtctacac 


ctcacaggtc 


gtcacgatcg 


aagcccaggt cgagtacatc 


1260 


gtgagctgca 


ttcttcaaat 


ggacgagcgc 


ggcatcggca 


gcatcgacgt ccgcgcagac 


1320 


gtgcaacgcg 


agttcgtacg 


cgcgacagac 


cgccgactcg 


ccaccagcgt gtggaacgcc 


1380 


ggcgggtgca 


gtagttacta 


cctcgtcgac 


ggcggtcgca 


actacacctt ctatcccgga 


144 0 
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ttcaaccgat cattccgggc caggaccaaa cgagccgacc tcgctcacta cgcgcaggta 1500 
caacccgtct cgtccgcagc actcaccact gctcgagaaa ccgtgaggag ccgataa 1557 

<210> 24 
<211> 518 
<212> PRT 

<213> Rhodococcus erythropolia AN12 
<400> 24 

Met Val Asp lie Asp Pro Thr Ser Gly Pro Ser Ala Gly Asp Glu Glu 
15 10 15 

Thr Arg Thr Arg Arg Thr Arg Val Val Val He Gly Ala Gly Phe Gly 
20 25 30 

Gly He Gly Thr Ala Val Arg Leu Lys Gin Ser Gly He Asp Asp Phe 
35 40 45 

Val Val Leu Glu Arg Ala Ala Glu Pro Gly Gly Thr Trp Gin Val Asn 
50 55 60 

Thr Tyr Pro Gly Ala Gin Cys Asp He Pro Ser He Leu Tyr Ser Phe 
65 70 75 80 

Ser Phe Ala Pro Asn Pro Asn Trp Thr Arg Leu Tyr Pro Leu Gin Pro 



Glu He Tyr Asp Tyr Leu Arg Asp Cys Val His Arg Phe Gly Leu Ala 
100 105 110 



Gly His Phe His Cys Asn Gin Asp Val Thr Glu Ala Ser Trp Asp Glu 
115 120 125 



Gin Ala Gin He Trp Arg Val His Thr Ala Glu Thr Val Trp Glu Ala 
130 135 140 



Gin Phe Leu Val Ala Ala Thr Gly Pro Phe Ser Ala Pro Ala Thr Pro 
145 150 155 160 



Asp Leu Pro Gly Leu Glu Ser Phe Arg Gly Gin Met Phe His Thr Ala 
165 170 175 
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Asp Trp Asn His Asp His Asp Leu Arg Gly Glu Arg lie Ala Val Val 
180 185 190 



Gly Thr Gly Ala Ser Ala Val Gin lie He Pro Arg Leu Gin Pro Leu 
195 200 205 



Ala Asp Thr Leu Thr Val Phe Gin Arg Thr Pro Thr Trp He Leu Pro 
210 215 220 



His Pro Asp Gin Pro Met Thr Gly Trp Pro Ser Ala Leu Phe Glu Arg 
225 230 235 240 



Val Pro Leu Thr Gin Arg Leu Ala Arg Lys Gly Leu Asp Leu Leu Gin 
245 250 255 



Glu Ala Leu Val Pro Gly Phe Val Tyr Lys Pro Ser Leu Leu Lys Gly 
260 265 270 



Leu Ala Ala Leu Gly Arg Ala His Leu Arg Arg Gin Val Arg Asp Pro 
275 280 285 



Glu Leu Arg Ala Lys Leu Leu Pro His Tyr Ala Phe Gly Cys Lys Arg 
290 295 300 



Pro Thr Phe Ser Asn Thr Tyr Tyr Pro Ala Leu Ala Ser Pro Asn Val 
305 310 315 320 



Thr Ala Asp Gly Ala Phe Arg Glu Val Asp Thr He Val Met Gly Thr 
340 345 350 



Asp Gly Arg Ser Leu Ala Gin Thr Trp Asn Gly Ser Ala Glu Ala Phe 
370 375 380 



Leu Gly Thr Thr He Ser Gly Phe Pro Asn Phe Phe Met He Leu Gly 
385 390 395 400 



Pro Asn Ser Val Val Tyr Thr Ser Gin Val Val Thr He Glu Ala Gin 
405 410 415 



Val Glu Tyr He Val Ser Cys He Leu Gin Met Asp Glu Arg Gly He 
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Gly Ser He Asp Val Arg Ala Asp Val Gin Arg Glu Phe Val Arg Ala 
435 440 445 

Thr Asp Arg Arg Leu Ala Thr Ser Val Trp Asn Ala Gly Gly Cys Ser 
450 455 4S0 

Ser Tyr Tyr Leu Val Asp Gly Gly Arg Asn Tyr Thr Phe Tyr Pro Gly 
465 470 475 480 

Phe Asn Arg Ser Phe Arg Ala Arg Thr Lys Arg Ala Asp Leu Ala His 
485 490 495 

Tyr Ala Gin Val Gin Pro Val Ser Ser Ala Ala Leu Thr Thr Ala Arg 
500 505 510 

Glu Thr Val Arg Ser Arg 
515 

<210> 25 

<211> 1626 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 25 

atgaccgatc ctgacttctc caccgcacca ctcgacgtcg tagtcatcgg cgccggcgtc 60 

gctggcatgt acgccatgca ccgacttcgc gagcaggggc tgcgtgtcca cggcttcgag 120 

gcgggctccg gagtgggcgg cacgtggtat ttcaaccgct accccggcgc acgctgcgac 180 

gtcgagagtt tcgactactc ctactcgttc tccgaagagc tgcaacagga ttgggactgg 240 

agcgagaagt acgccgcgca accggagatc ctctcgtacc tcgatcacgt ggctgatcgc 300 

ttcgacctac gcactggctt caccttcgac acacgcgttc tgagcgcaca gttcgacgag 360 

ggtactgcca cgtggcgagt acagaccgac ggcggtcacg acgtcacctc acgcttcgtc 420 

gtgtgcgcca cgggcagcct ctcgaccgca aacgttccga acattgcggg ccgtgagacc 480 

ttcggtggcg atgtgttcca caccggtttc tggccgcacg agggcgtcga cttcaccggc 540 

aaacgcgtcg gcgtgatcgg caccggatcc tcgggcatcc agtccattcc gctgatcgcc 600 

gagcaggccg atcatctcta cgtgttccag cggtccgcga attacagtgt gccggcagga 660 

aacacgcctc tcgatgacaa gcgccgcgcc gagatcaagg ccggctacgc agagcgtcga 720 
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gcgctgtcca agcgcagtgg cggtggatcg ccgttcgttt cggatcctcg cagcgccctc 780 

gaagtctcgg aggccgagag aaacgcggca tacgaggagc ggtggaagct cggcggtgtc 840 

ctgttcgcca agacattcgc agaccagacg agcaacatcg aggccaacgg gacagcggca 900 

gcgtttgccg aacgcaagat tcgctcggaa gtccaggatc aggcgatcgc cgacctgctc 960 

attccgaacg accaccccat cggaaccaag cggatagtca cggacacgaa ctactaccag 1020 

agctacaacc gtgacaacgt cagcctggta gatctcaagt ccgcaccgat cgaggcgatc 1080 

gacgaggctg gaatcaagac ggccgatgcg cactacgaac tggatgcgct ggtgtttgcc 1140 

accgggttcg acgcgatgac gggagcgctc gatcgcatcg agatccgcgg ccgcaatggc 1200 

gagacgttgc gcgagaactg gcatgcgggt ccaaggacgt atctaggcct cggagtacac 1260 

gggttcccca acctgttcat cgtcaccggg ccgggtagcc cgagtgtgct gtccaacatg 1320 

attctcgctg ccgagcagca cgtggactgg atcgcgggcg cgatcaacca cctcgattcg 1380 

gcgggcatcg acaccatcga accgagtgec gaagccgtgg acaactggct cgacgaatgc 1440 

tcacgccggg cgtcggcgac gctgtttcca tccgcgaact cctggtacat gggagccaac 1500 

attccgggaa agccgaggat attcatgcca ttcatcggag gattcggtgt ctactccgac 1560 

atctgtgcag acgtggcagc agcgggatac cgaggcttcg aactgaacag tgcggtgcac 1620 

gcatga 1626 

<210> 26 
<211> 541 
<212> PRT 

<213> Rhodococcus erythropolis AN12 



<400> 26 

Met Thr Asp Pro Asp Phe Ser Thr Ala Pro Leu Asp Val Val Val lie 
15 10 15 

Gly Ala Gly Val Ala Gly Met Tyr Ala Met His Arg Leu Arg Glu Gin 
20 25 30 

Gly Leu Arg Val His Gly Phe Glu Ala Gly Ser Gly Val Gly Gly Thr 
35 40 45 

Trp Tyr Phe Asn Arg Tyr Pro Gly Ala Arg Cys Asp Val Glu Ser Phe 
50 55 60 
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Asp Tyr Ser Tyr Ser Phe Ser Glu Glu Leu Gin Gin Asp Trp Asp Trp 



Ser Glu Lys Tyr Ala Ala Gin Pro Glu lie Leu Ser Tyr Leu Asp His 



Val Ala Asp Arg Phe Asp Leu Arg Thr Gly Phe Thr Phe Asp Thr Arg 
100 105 110 



Val Leu Ser Ala Gin Phe Asp Glu Gly Thr Ala Thr Trp Arg Val Gin 
115 12 0 125 



Thr Asp Gly Gly His Asp Val Thr Ser Arg Phe Val Val Cys Ala Thr 
130 135 140 



Phe Gly Gly Asp Val Phe His Thr Gly Phe Trp Pro His Glu Gly Val 
165 170 175 



Asp Phe Thr Gly Lys Arg Val Gly Val lie Gly Thr Gly Ser Ser Gly 
180 1B5 190 



lie Gin Ser He Pro Leu He Ala Glu Gin Ala Asp His Leu Tyr Val 
195 200 205 



Phe Gin Arg Ser Ala Asn Tyr Ser Val Pro Ala Gly Asn Thr Pro Leu 
210 215 220 



Asp Asp Lys Arg Arg Ala Glu He Lys Ala Gly Tyr Ala Glu Arg Arg 
225 230 235 240 



Ala Leu Ser Lys Arg Ser Gly Gly Gly Ser Pro Phe Val Ser Asp Pro 
245 250 255 



Arg Ser Ala Leu Glu Val Ser Glu Ala Glu Arg Asn Ala Ala Tyr Glu 
260 265 270 



Glu Arg Trp Lys Leu Gly Gly Val Leu Phe Ala Lys Thr Phe Ala Asp 
275 280 285 



Gin Thr Ser Asn He Glu Ala Asn Gly Thr Ala Ala Ala Phe Ala Glu 
290 295 300 



44 



WO 03/020890 



PCT7US02/27549 



Arg Lys lie Arg Ser Glu Val Gin Asp Gin Ala He Ala Asp Leu Leu 
305 310 315 320 



He Pro Asn Asp His Pro He Gly Thr Lys Arg He Val Thr Asp Thr 
325 330 335 



Asn Tyr Tyr Gin Ser Tyr Asn Arg Asp Asn Val Ser Leu Val Asp Leu 
340 345 350 



Lys Ser Ala Pro He Glu Ala He Asp Glu Ala Gly He Lys Thr Ala 
355 360 365 



Asp Ala His Tyr Glu Leu Asp Ala Leu Val Phe Ala Thr Gly Phe Asp 
370 375 380 



Ala Met Thr Gly Ala Leu Asp Arg He Glu He Arg Gly Arg Asn Gly 
385 390 395 400 



l Thr Leu Arg Glu Asn Trp His Ala Gly Pro Arg Thr Tyr Leu Gly 
405 410 415 



Leu Gly Val His Gly Phe Pro Asn Leu Phe He Val Thr Gly Pro Gly 
420 " 425 430 



Ser Pro Ser Val Leu Ser Asn Met He Leu Ala Ala Glu Gin His Val 
. 435 440 445 



Asp Trp He Ala Gly Ala He Asn His Leu Asp Ser Ala Gly He Asp 
450 455 460 



Thr He Glu Pro Ser Ala Glu Ala Val Asp Asn Trp Leu Asp Glu Cys 
465 470 475 480 



Ser Arg Arg Ala Ser Ala Thr Leu Phe Pro Ser Ala Asn Ser Trp Tyr 
485 490 495 



Met Gly Ala Asn He Pro Gly Lys Pro Arg He Phe Met Pro Phe He 
500 505 510 



Gly Gly Phe Gly Val Tyr Ser Asp He Cys Ala Asp Val Ala Ala Ala 
515 ' 520 525 



Gly Tyr Arg Gly Phe Glu Leu Asn Ser Ala Val His Ala 
530 535 540 
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<210> 27 

<211> 1389 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 27 
atgagcccct 


cccccttgcc gagcgtctgc atcatcggcg ccgggcctac cggaatcacc 


60 


acggccaagc 


gaatgaagga attcggaata cccttcgact gctacgaagc gtccgacgag 


120 


gtcggcggaa 


actggtacta caagaacccc aacggaatgt cggcctgcta ccagagcctg 


180 


catatcgaca 


cgtcgaagtg gcgcttggca ttcgaggact tcccggtctc tgccgacctt 


240 


cccgatttcc 


cccaccattc cgaactcttc cagtacttca aggactacgt cgagcatttc 


300 


ggcctgcgtg 


agtcgatcat cttcaacacc agtgttgttg ctgcagagcg tgatgcaaac 


360 


ggactgtgga 


ccgtcacgcg ctcggacggc gaagtccgta cctacgacgt cctgatggtc 


420 


tgcaatggtc 


accactggga tcccaatatc ccggattacc cgggcgagtt cgacggcgtc 


480 


ctcatgcaca 


gccacagcta caacgacccg ttcgatccga tcgacatgcg cggcaagaaa 


540 


gtagtcgtgg 


tcggaatggg gaactccggc ttggacattg cttccgaact ggggcagaga 


600 


tacctcgccg 


acaagctcat cgtctcggcg cgccgcggcg tgtgggtgtt gccgaaatac 


660 


ctgggcggcg 


tgccgggaga caaactgatc accccgccct ggatgcctcg ggggctgcgc 


720 


ctgttcctga 


gtcgtcgatt cctcggcaag aacctgggaa ccatggaggg ctacggacta 


780 


cccaagccag 


atcaccgccc cttcgaggca catccgtcag ccagtggcga gttcttggga 


840 


cgtgccgggt 


ccggcgacat caccttcaag ccggcgatca ccaaactcga cggaaagcag 


900 


gttcatttcg 


ccgacggcac cgccgaggac gtcgacgtgg tcgtctgcgc caccggctac 


960 


aacatcagct 


tccccttctt cgacgacccg aacctgctgc cggacaaaga caaccgattc 


1020 


ccactcttca 


aacgcatgat gaagcccgga atcgacaacc tcttcttcat gggactcgct 


1080 


cagcccatgc 


cgacgctcgt aaacttcgcc gagcagcaga gcaagctcgt cgcggcctac 


1140 


ctcaccggta 


aataccagct gccgtccgcg aacgagatgc aggagatcac caaggccgac 


1200 


gaggcgtact 


tcctcgcccc ctattacaag tcaccgcgcc acaccattca gctcgagttc 


1260 


gacccgtacg 


tccgcaacat gaacaaggaa attgccaagg gcaccaagcg tgccgcggcc 


1320 


tcggggaaca 


aactacctgt tgcggcgcgt gcagcagcac acgaactcga gaaggcggat 


1380 


cgcgcatga 




1389 
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<210> 28 

<211> 462 

<212> PRT 

<213> Rhodococcus erythropolis AN12 

<400> 28 

Met Ser Pro Ser Pro Leu Pro Ser Val Cys lie lie Gly Ala Gly Pro 
1 5 10 15 

Thr Gly He Thr Thr Ala Lys Arg Met Ly S Glu Phe Gly He Pro Phe 
20 25 30 

Asp Cys Tyr Glu Ala Ser ' Asp Glu Val Gly Gly Asn Trp Tyr Tyr Lys 
35 40 45 

Asn Pro Asn Gly- Met Ser Ala Cys Tyr Gin Ser Leu His" He Asp Thr" ' 
50 55 so 

Ser Lys Trp Arg Leu Ala Phe Glu Asp Phe Pro Val Ser Ala Asp Leu 



Pro Asp Phe Pro His His Ser Glu Leu Phe Gin Tyr Phe Lys Asp Tyr 
85 , 90 95 

Val Glu His Phe Gly Leu Arg Glu Ser He He Phe Asn Thr Ser \ 



Val Ala Ala Glu Arg Asp Ala Asn Gly Leu Trp Thr Val Thr Arg Ser 
115 120 125 

Asp Gly Glu Val Arg Thr Tyr Asp Val Leu Met Val Cys Asn Gly His 



His Trp Asp Pro Asn He Pro Asp Tyr Pro Gly Glu Phe Asp Gly Val 
145 "0 155 i 6 o 



Leu Met His Ser His Ser Tyr Asn Asp Pro Phe Asp Pro He Asp Met 
16 5 170 175 

Arg Gly Lys Lys Val Val Val Val Gly Met Gly Asn Ser Gly Leu Asp 
180 185 * 190 

He Ala Ser Glu Leu Gly Gin Arg Tyr Leu Ala Asp Lys Leu He Val 
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Ser Ala Arg Arg Gly Val Trp Val Leu Pro Ly3 Tyr Leu Gly Gly Val 
210 215 220 



Pro Gly Asp Lys Leu lie Thr Pro Pro Trp Met Pro Arg Gly Leu Arg 
225 230 235 240 



Leu Phe Leu Ser Arg Arg Phe Leu Gly Lys.Asn Leu Gly Thr Met Glu 
245 250 255 



Gly Tyr Gly Leu Pro Lys Pro Asp His Arg Pro Phe Glu Ala His Pro 
260 265 270 



Ser Ala Ser Gly Glu Phe Leu Gly Arg Ala Gly Ser Gly Asp lie Thr 
275 280 285 



Phe Lys Pro Ala He Thr Lys Leu Asp Gly Lys Gin Val His Phe Ala 
290 295 300 



Asp Gly Thr Ala Glu Asp Val Asp Val Val Val Cys Ala Thr Gly Tyr 
305 310 315 320 



Asp Asn Arg Phe Pro Leu Phe Lys Arg Met Met Lys Pro Gly He Asp 
340 345 350 



Asn Leu Phe Phe Met Gly Leu Ala Gin Pro Met Pro Thr Leu Val Asn 
355 360 365 



Phe Ala Glu Gin Gin Ser Lys Leu Val Ala Ala Tyr Leu Thr Gly Lys 
370 375 360 



Tyr Gin Leu Pro Ser Ala Asn Glu Met Gin Glu He Thr Lys Ala Asp 
385 390 395 400 



Glu Ala Tyr Phe Leu Ala Pro Tyr Tyr Lys Ser Pro Arg His Thr He 
405 410 415 



Gin Leu Glu Phe Asp Pro Tyr Val Arg Asn Met Asn Lys Glu He Ala 
420 425 430 



Lys Gly Thr Lys Arg Ala Ala Ala Ser Gly Asn Lys Leu Pro Val Ala 
435 440 445 
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Ala Arg Ala Ala Ala His Glu Leu Glu Lys Ala Asp Arg Ala 
450 455 460 

<210> 29 
<211> 1572 
<212> DNA 

<213> Rhodococcus erythropolis AN12 
<400> 29 

gtgaacaacg aatctgacca cttcgaggtc gtgatcatcg gcggtggaat ttccggaatc 60 

ggcgcggcta tccacctgca gcgtctcgga atcgacaact tcgcactcct cgagaaggcc 120 

gactccctcg gtggaacctg gcgcgccaac acctatcccg ggtgcgcctg cgacgttcca 180 

tccggtctgt actcgtactc ctttgccgcc aatccggatt ggacgcgctt gttcgcggag 240 

caaccggaga tccgcgaata catcgagaac acggcgggca cgcacggagt cgacaaacac 300 

gttcgcttcg gggtcgaaat gctctccgcg cgatgggatg cgtcgcaatc actgtggaag 360 

ataacaactt ccagcggcga actgactgct cgcttcgtga tagccgctgc cggcocatgg 420 

aacgaacccc tgacaccggc gatccccgga ctggaagcgt tcgagggaga ggtgtttcat 480 

tcctcgcagt ggaatcacga ctacgacctg accggaaaac tcgtcgccgt cgtaggaacc 540 

ggagcgtcgg cagtccagtt cgttccgcgc atcgtctccc aggtctccgc ccttcacctc 600 

taccagcgaa ccgctcaatg ggttctcccc aaacccgatc actacgtacc gcggatcgaa 660 

aggtccgtca tgcgattcgt gccgggagca cagaaagcct tgcgcagcat cgaatacgga 720 

atcatggaag cgctcggatt gggattccgt aatccatgga tcctgcgaat cgtgcagaaa 780 

ctcgggtcag cccaattgcg cctacaggta cgcgatccga agctgcgcaa ggcattgact 840 

cccgactaca ccctcggttg caagcgactg ctcatgtcga actcgtacta tccggccctc 900 

ggcaaaccca acgtcagcgt ccatgccaac gccgtcgagc agatccgcgg taacaccgtg 960 

atcggcgccg acggagtgga ggcggaggtg gacgccatca tcttcggaac gggcttccac 1020 

atcctcgaca tgcccatcgc atccaaggta ttcgacggag aaggtcgatc actcgacgat 1080 

cattggcagg gaagcccgca ggcgtacttc ggctccgccg tcagtggatt ccccaacgca 1140 

ttcatcctgc tgggcccgag cctcggcacc gggcacacat cggcgttcat gatcttggaa 1200 

gcccaactga actatgtggc gcaggcaatc ggccacgccc gtcgtcacgg ctggcagacc 1260 

atcgacgtgc gagaggaagt tcaggcagcc ttcaattctc aggttcagga ggcattgggg 1320 
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accacggtct acaacgccgg tggttgcgaa agctatttct tcgacgtcaa cggccgcaac 13 80 

agtttcaact ggccgtggtc gtccggcgcc atgcgtcgac ggctacggga cttcgatccg 1440 

tatgcctaca accacacgtc gaaccctgag tcagacaaca cgccccctga acccacgcca 1500 

tccgaaccca cgccatctga acccacgcca tccgagccca ccaccagtcc ggaaccggag 1560 

tacaccgcat ga 1572 

<210> 30 

<211> 523 

<212> PRT 

<213> Rhodococcus erythropolis AN12 

<400> 30 

Val Asn Asn Glu Ser Asp His Phe Glu Val Val lie lie Gly Gly Gly 
1 5 10 15 

lie Ser Gly lie Gly Ala Ala He His Leu Gin Arg Leu Gly He Asp 
20 25 30 

Asn Phe Ala Leu Leu Glu Lys Ala Asp Ser Leu Gly Gly Thr Trp Arg 
35 40 45 

Ala Asn Thr Tyr Pro Gly Cys Ala Cys Asp Val Pro Ser Gly Leu Tyr 



Ser Tyr Ser Phe Ala Ala Asn Pro Asp Trp Thr Arg Leu Phe Ala Glu 
65 70 75 80 



Gin Pro Glu He Arg Glu Tyr He Glu Asn Thr Ala Gly Thr His Gly 



Val Asp Lys His Val Arg Phe Gly Val Glu Met Leu Ser Ala Arg Trp 
100 105 110 



Asp Ala Ser Gin Ser Leu Trp Lys He Thr Thr Ser Ser Gly Glu Leu 
115 120 125 



Thr Ala Arg Phe Val He Ala Ala Ala Gly Pro Trp Asn Glu Pro Leu 
130 135 140 



Thr Pro Ala He Pro Gly Leu Glu Ala Phe Glu Gly Glu Val Phe His 
145 150 155 " 160 
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Ser Ser Gin Trp Asn His Asp Tyr Asp Leu Thr Gly Lys Leu Val Ala 
1S5 170 175 



Val Val Gly Thr Gly Ala Ser Ala Val Gin Phe Val Pro Arg He Val 
180 185 190 



Ser Gin Val Ser Ala Leu His Leu Tyr Gin Arg Thr Ala Gin Trp Val 
195 200 205 



Leu Pro Lys Pro Asp His Tyr Val Pro Arg He Glu Arg Ser Val Met 
210 215 220 



Arg Phe Val Pro Gly Ala Gin Lys Ala Leu Arg Ser He Glu Tyr Gly 
225 230 235 240 



He Met Glu Ala Leu Gly Leu Gly Phe Arg Asn Pro Trp He Leu Arg 
245 250 255 



He Val Gin Lys Leu Gly Ser Ala Gin Leu Arg Leu Gin Val Arg Asp 
260 265 270 



Pro Lys Leu Arg Lys Ala Leu Thr Pro Asp Tyr Thr Leu Gly Cys Lys 
275 280 285 



Arg Leu Leu Met Ser Asn Ser Tyr Tyr Pro Ala Leu Gly Lys Pro Asn 
290 295 300 



Val Ser Val His Ala Asn Ala Val Glu Gin He Arg Gly Asn Thr Val 
305 310 315 320 



He Gly Ala Asp Gly Val Glu Ala Glu Val Asp Ala He He Phe Gly 
325 330 335 



Thr Gly Phe His He Leu Asp Met Pro He Ala Ser Lys Val Phe Asp 
340 345 350 



Gly Glu Gly Arg Ser Leu Asp Asp His Trp Gin- Gly Ser Pro Gin Ala 
355 360 365 



Tyr Phe Gly Ser Ala Val Ser Gly Phe Pro Asn Ala Phe He Leu Leu 
370 375 380 



Gly Pro Ser Leu Gly Thr Gly His Thr Ser Ala Phe Met He Leu Glu 
385 390 395 400 
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Ala Gin Leu Asn Tyr Val Ala Gin Ala lie Gly His Ala Arg Arg His 
405 410 415 

Gly Trp Gin Thr lie Asp Val Arg Glu Glu Val Gin Ala Ala Phe Asn 
420 425 430 

Ser Gin Val Gin Glu Ala Leu Gly Thr Thr Val Tyr Asn Ala Gly Gly 
435 440 445 

Cys Glu Ser Tyr Phe Phe Asp Val Asn Gly Arg Asn Ser Phe Asn Trp 
450 455 460 

Pro Trp Ser Ser Gly Ala Met Arg Arg Arg Leu Arg Asp Phe Asp Pro 
465 470 475 480 

Tyr Ala Tyr Asn His Thr Ser Asn Pro Glu Ser Asp Asn Thr Pro Pro 
485 490 495 

Glu Pro Thr Pro Ser Glu Pro Thr Pro Ser Glu Pro Thr Pro Ser Glu 
500 505 510 

Pro Thr Thr Ser Pro Glu Pro Glu Tyr Thr Ala 
515 520 

<210> 31 

<211> 1482 

<212> DMA 

<213> Rhodococcus erythropolis AN12 



<400> 31 

atgagcaccg aacacctcga tgtcctgatc gtcggcgccg gcttgtccgg catcggtgct 60 

gcttatcgac tccagaccga gctcccagga aagtcgtacg caatcctcga ggcccgagcg 120 

aacagcggcg gaacctggga cctcttcaag tatcccggca tccgatcgga ttccgacatg 180 

ttcacgctcg gctacccgtt tcgcccgtgg acagatgcca aagcaatcgc cgacggtgat 240 

tcgatcctgc ggtacgtgcg cgacaccgcg cgagagaacg ggatcgacaa gaagattcgg 300 

tacaaccgga aggtgacggc cgcatcatgg tcgtcagcga cctcgacctg gacagtcacg 360 

gtcacgaccg gcgacgaaga cgaaacattg acctgtaact tcctctatct ctgcagcggg 420 

tactacagct acgacggcgg atacaccccc gacttccccg gacgtgaatc gtttgccggt 480 

gaggtagtgc acccccagtt ctggcccgaa gaactcgatt actccgacaa gaaggtcgtt 54 0 
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gtgatcggaa gcggcgccac cgcagtcact ttggtcccca cgatgtcacg ggacgcaagc 600 

cacgtcacga tgctccagcg atcaccgacg tacattctgg cgcttccgtc cagcgacaaa 660 

ttatcggaca ccattcgcgc ggtactgccg aatcaactcg cgcacagcat cgctcgatgg 720 

aagagcgtcg tagtgaacct gagtttctac caactgtgcc gacgcagtcc ggcgcgtgca 780 

aagaggatgc tgaacctcgc gatcagtcgt caactcccga aagacatccc cctcgatcct 840 

cacttcacac cctcctacga tccctgggac cagcgcttgt gcgtcgtacc cgacggcgat 900 

ttgttcaaag ccctccgatc cggcaaggcc tcgatcgaga ccgatcacat cgacaccttc 960 

accgagaccg ggatccttct cgcgtcaggt cgcgaactcg aagctgacat catcgtcact 1020 

gcaacaggat tgaagatgga ggcgtgcggc gggatgtcca tcgaagtgga cggcgaactc ' 1080 

gtcaccctcg gtgatcgtta cgcctacaag ggcatgatga tcagcgacgt accgaacttc 1140 

gcgatgtgcg tcggctacac caacgcctcg tggactctgc gagcagatct cacgtcgatg 1200 

tacgtgtgcc gactgctgac ggagatggac aagcgcgact attcgaagtg cgtgccgcac 1260 

gcgaccgaag aaatggacca gcggccgatc ctggatctgg cgtcggggta cgtcatgcgt 1320 

gccgtggaac agttcccgaa gcagggatcg aagtcaccgt ggaacatgcg tcagaactac 1380 

atccttgacc gtcttcactc cacgttcggg agcatcaacg accacatgac gttctcgaag 1440 

gcaccagctc gacattcgac gccggtaccg agcaagagtt ga 1482 

<210> 32 
<211> 493 
<212> PRT 

<213> Rhodococcus erythropolis AN12 



<400> 32 

Met Ser Thr Glu His Leu Asp Val Leu lie Val Gly Ala Gly Leu Ser 
15 10 15 

Gly He Gly Ala Ala Tyr Arg Leu Gin Thr Glu Leu Pro Gly Lys Ser 
20 25 30 

Tyr Ala He Leu Glu Ala Arg Ala Asn Ser Gly Gly Thr Trp Asp Leu 
35 40 45 

Phe Lys Tyr Pro Gly He Arg Ser Asp Ser Asp Met Phe Thr Leu Gly 
50 55 60 
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Tyr Pro Phe Arg Pro Trp Thr Asp Ala Lys Ala He Ala Asp Gly Asp 
65 70 75 80 

Ser He Leu Arg Tyr Val Arg Asp Thr Ala Arg Glu Asn Gly He Asp 



Lys Lys He Arg Tyr Asn Arg Lys Val Thr Ala Ala Ser Trp Ser Ser 
100 105 110 



Ala Thr Ser Thr Trp Thr Val Thr Val Thr Thr Gly Asp Glu Asp Glu 
115 120 125 



Thr Leu Thr Cys Asn Phe Leu Tyr Leu Cys Ser Gly Tyr Tyr Ser Tyr 
130 135 140 



Asp Gly Gly Tyr Thr Pro Asp Phe Pro Gly Arg Glu Ser Phe Ala Gly 
145 150 155 160 



Glu Val Val His Pro Gin Phe Trp Pro Glu Glu Leu Asp Tyr Ser Asp 
165 170 175 



Lys Lys Val Val Val He Gly Ser Gly Ala Thr Ala Val Thr Leu Val 
180 185 190 



Pro Thr Met Ser Arg Asp Ala Ser His Val Thr Met Leu Gin Arg Ser 
195 200 205 



Pro Thr Tyr He Leu Ala Leu Pro Ser Ser Asp Lys Leu Ser Asp Thr 
210 215 220 



He Arg Ala Val Leu Pro Asn Gin Leu Ala His Ser He Ala Arg Trp 
225 230 235 240 



Lys Ser Val Val Val Asn Leu Ser Phe Tyr Gin Leu Cys Arg Arg Ser 
245 250 " 255 



Pro Ala Arg Ala Lys Arg Met Leu Asn Leu Ala He Ser Arg Gin Leu 
260 265 270 



Pro Lys Asp He Pro Leu Asp Pro His Phe Thr Pro Ser Tyr Asp Pro 
275 ' 280 285 



Trp Asp Gin Arg Leu Cys Val Val Pro Asp Gly Asp Leu Phe Lys Ala 
290 295 300 
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Leu Arg Ser Gly Lys Ala Ser lie Glu Thr Asp His lie Asp Thr Phe 
305 310 315 320 



Thr Glu Thr Gly He Leu Leu Ala Ser Gly Arg Glu Leu Glu Ala Asp 
325 330 335 



He He Val Thr Ala Thr Gly Leu Lys Met Glu Ala Cys Gly Gly Met 
340 345 350 



Ser He Glu Val Asp Gly Glu Leu Val Thr Leu Gly Asp Arg Tyr Ala 
355 360 365 



Tyr Lys Gly Met Met He Ser Asp Val Pro Asn Phe Ala Met Cys Val 
370 375 380 



Gly Tyr Thr Asn Ala Ser Trp Thr Leu Arg Ala Asp Leu Thr Ser Met 
385 390 395 400 



Tyr Val Cys Arg Leu Leu Thr Glu Met Asp Lys Arg Asp Tyr Ser Lys 
405 410 415 



Cys Val Pro His Ala Thr Glu Glu Met Asp Gin Arg Pro He Leu Asp 
420 425 430 



Leu Ala Ser Gly Tyr Val Met Arg Ala Val Glu Gin Phe Pro Lys Gin 
435 440 445 



Gly Ser Lys Ser Pro Trp Asn Met Arg Gin Asn Tyr He Leu Asp Arg 
450 455 460 



Leu His Ser Thr Phe Gly Ser He Asn Asp His Met Thr Phe Ser Lys 

465 470 475 480 



Ala Pro Ala Arg His Ser Thr Pro Val Pro Ser Lys Ser 
485 490 



<210> 33 

<211> 1620 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 33 

atgacagacg aattcgacgt agtgatcgtg ggtgcaggtc tcgcaggtat gcagatgctg 60 
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cacgaggttc gcatggtcgg cctcacggcc aaagttttcg aggccggcgg aggtgcaggt 120 

ggcacctggt attggaaccg ctacccgggt gctcggtgtg acgtggagag tttggagtac 180 

tcctatcagt tctccgaggt gctccaacag gaatgggaat ggacccgccg gtacgcagat 240 

caggccgaga tcatgcgcta catcagccac gtcgtcgaaa ccttcgacct ggcccgcgac 300 

atcaggtttc atacccgggt cgaggcgatg acctacgagg agaccaccgc caggtggacg 360 

gttcagacgg ' acagtgccgg cgaggttgtg gccaaattcg tgattatggc caccgggtgt 420 

ctgtcggagc cgaacgtgcc gtacataccg ggtgtggaga cattcgcggg cgacgtgctg 480 

cacaccgggc gctggccgca ggatcccgtc gacttcacag gcaagcgggt cggcgtgatc 540 

ggaaccggat catctggcgt gcaagccatc ccactcatcg cgcggcaagc ggccgagctc SOO 

gtagtctttc agcgcactcc tgcatacacg ttgcccgctg tcgacgagcc gctcgacccg 660 

gaattgcagg cggcgatcaa ggccgattac agggggttcc gtgcgcgaaa caacgaagtg 720 

cccaccgcgg gactctcccg atttccgacg aatccgaact cggttttcct gttctcaacg 780 

aaggagcggg atgccatcct cgaacacaat tggaaccgag gcgggccgtt gatgctgcgc 840 

gccttcggcg atctgctggt ggactcagcc gctaacgagg tggtagccga gttcgtccgc 900 

aacaagatcc gccagatcgt taccgacccc gaggtcgctg cgaagctcac accgacacac 960 

gtgatcggat gcaaacgaat ctgtctcagc gacggctatt acgagaccta caaccgggtc 102 0 

aacgtgcgct tagtcgacat caaacgccac ccaatcgagg agatcacgcc tactacagcc 108 0 

cggaccggcg aggactcgca tgacctggac atgctcgtgt tcgccactgg ctacgatgcc 1140 

atcactggcg cactctcacg catcgacatc cgcggccgcg cagggttgtc attgcaggaa 12 00 

gcatggtcgg acggaccgcg cacctatctc gggctcgggg tctccggctt cccaaatctg 1260 

ttcatcatga ccggccccgg aagcccatcg gtattgacca atgttcttgt cgccatacac 1320 

caacatgcga catggatcgg cgaatgcctg aagcatatga ccgacaacga tattcggaca 1380 

atggaagcca cgcccgaagc cgagcagaac tggggggacc acgtgcgcga cctcgccgag 1440 

cagaccctgc tctcatcgtg cgggtcctgg tacctcggag caaacatccc cggtaagaga 1500 

caagtattca tgccgctggt cgggtttccg gactacgcca agaaatgcgc ggaaatcgca 1560 

tccgccggct acccgggctt cgccttccag tacgaccccg tccctgtgaa ccagagctga 1620 

<210> 34 
<211> 539 
<212> PRT 

<213> Rhodococcus erythropolis AN12 
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<400> 34 

Met Thr Asp Glu Phe Asp Val Val lie Val Gly Ala Gly Leu Ala Gly 
1 5 10 15 



Met Gin Met Leu His Glu Val Arg Met Val Gly Leu Thr Ala Lys Val 
20 25 30 

Phe Glu Ala Gly Gly Gly Ala Gly Gly Thr Trp Tyr Trp Asn Arg Tyr 
35 40 45 

Pro Gly Ala Arg Cys Asp Val Glu Ser Leu Glu Tyr Ser Tyr Gin Phe 
50 55 60 

Ser Glu Val Leu Gin Gin Glu Trp Glu Trp Thr Arg Arg Tyr Ala Asp 
65 70 75 80 

Gin Ala Glu He Met Arg Tyr He Ser His Val Val Glu Thr Phe Asp 



Leu Ala Arg Asp He Arg Phe His Thr Arg Val Glu Ala Met Thr Tyr 
100 105 110 



Glu Glu Thr Thr Ala Arg Trp Thr Val Gin Thr Asp Ser Ala Gly Glu 
115 120 125 



Val Val Ala Lys Phe Val He Met Ala Thr Gly Cys Leu Ser Glu Pro 
130 135 140 



Asn Val Pro Tyr lie Pro Gly Val Glu Thr Phe Ala Gly Asp Val Leu 
145 150 155 160 



His Thr Gly Arg Trp Pro Gin Asp Pro Val Asp Phe Thr Gly Lys Arg 
165 170 175 



Val Gly Val He Gly Thr Gly Ser Ser Gly Val Gin Ala He Pro ] 
180 185 190 



He Ala Arg Gin Ala Ala Glu Leu Val Val Phe Gin Arg Thr Pro Ala 
195 200 205 



Tyr Thr Leu Pro Ala Val Asp Glu Pro Leu Asp Pro Glu Leu Gin Ala 
210 215 220 



Ala He Lys Ala Asp Tyr Arg Gly Phe Arg Ala Arg Asn Asn Glu Val 
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Pro Thr Ala Gly Leu Ser Arg Phe Pro Thr Asn Pro Asn Ser Val Phe 
245 250 255 



Leu Phe Ser Thr Lys Glu Arg Asp Ala lie Leu Glu His Asn Trp Asn 
260 265 270 



Arg Gly Gly Pro Leu Met Leu Arg Ala Phe Gly Asp Leu Leu Val Asp 
275 280 285 



Ser Ala Ala Asn Glu Val Val Ala Glu Phe Val Arg Asn Lys He Arg 
290 295 300 



Gin He Val Thr Asp Pro Glu Val Ala Ala Lys Leu Thr Pro Thr His 
305 310 315 320 



Val He Gly Cys Lys Arg He Cys Leu Ser Asp Gly Tyr Tyr Glu Thr 
325 330 335 



Tyr Asn Arg Val Asn Val Arg Leu Val Asp He Lys Arg His Pro He 
340 345 350 



Glu Glu He Thr Pro Thr Thr Ala Arg Thr Gly Glu Asp Ser His Asp 
355 360 365 



Leu Asp Met Leu Val Phe Ala Thr Gly Tyr Asp Ala He Thr Gly Ala 
370 375 380 



Leu Ser Arg He Asp He Arg Gly Arg Ala Gly Leu Ser Leu Gin Glu 
385 390 395 400 



Ala Trp Ser Asp Gly Pro Arg Thr Tyr Leu Gly Leu Gly Val Ser Gly 
405 410 415 



Phe Pro Asn Leu Phe He Met Thr Gly Pro Gly Ser Pro Ser Val Leu 
420 425 430 



Thr Asn Val Leu Val Ala He His Gin His Ala Thr Trp He Gly Glu 
435 440 445 



Cys Leu Lys His Met Thr Asp Asn Asp He Arg Thr Met Glu Ala Thr 
450 455 460 



Pro Glu Ala Glu Gin Asn Trp Gly Asp His Val Arg Asp Leu Ala Glu 
465 470 - 475 
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Gin Thr Leu Leu Ser Ser Cys Gly Ser Trp Tyr Leu Gly Ala Asn He 
485 490 495 

Pro Gly Lys Arg Gin Val Phe Met Pro Leu Val Gly Phe Pro Asp Tyr 
500 505 510 

Ala Lys Lys Cys Ala Glu He Ala Ser Ala Gly Tyr Pro Gly Phe Ala 
515 520 525 

Phe Gin Tyr Asp Pro Val Pro Val Asn Gin Ser 
530 535 

<210> 3.5 

<211> 1950 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 35 
atgactatcg 


tcactgacct 


ggaccgtgac 


cacctgcgtt 


cggcggtgtt acggggcaat 


60 


gttccgacca 


tgctcgccgt 


gttgctggag 


ctgaccgccg 


atgagcggtg ggtggcaccc 


120 


cgctatcaac 


ccacgcgcag 


tcggggcatg 


gatgacaatt 


ccacgggagg acttccggag 


180 


gaggttcagt 


ccgaaatccg 


gagcgcgttg 


atcgacgcag 


tggaacgctg gtggacgctg 


240 


gacgagccgt 


cccggcggac 


gctggacagc 


tcggaagtag 


agcgaatcct caacttcacc 


300 


tgcagcgaga 


ccgtaccgcc 


ggacttcgcg 


ccgatgatgg 


cggagatagt caatggtccg 


360 


cagatcaagc 


ctgccaccgc 


caagtgcgac 


gagcgactcc 


acgccatcgt gatcggcgcc 


420 


ggcatcgcgg 


ggatgctggc 


ctccgtcgag 


ctcagccgcg 


ctgggatccc tcacgtgatc 


480 


ctggagaaga 


acgacgacgt 


cggcggatca 


tggtgggaga 


accgctatcc gggcgccgga 


540 


gttgatacac 


cgagccacct 


ttactcgatc 


tcgtcgttcc 


ctcgtaactg gtcgacccac 


600 


ttcggcaagc 


gcgacgaggt 


tcagggatat 


ctcgaggact 


ttgcggaggc caacgacatc 


660 


cggcgcaatg 


tccgcttccg 


tcatgaggtg 


acgcgcgccg 


agttcgagga gtcgaaacag 


720 


agttggcgtg 


tgtccgtcca 


gcgaccaggt 


gaggcgtcgg 


agaccctcga ggctcccatc 


780 


ctgatcagcg 


cggtcggtct 


gctcaatcgt 


ccgaagatcc 


cgcatctacc gggaatcgag 


840 


accttccgtg 


gtcgcctctt 


ccactccgcc 


gagtggccga 


gcgagctcga cgatcccgag 


900 


tcgctccgcg 


gaaagcgagt 


gggcatcgtc 


ggtaccggag 


ccagtgctat gcagatcggc 
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ccggccatcg cggatcgtgt cggatcgctg acgatcttcc agcgctcacc acagtggatc 1020 

gcaccgaacg acgactactt cacgaccatc gacgacggcg tccactggct gatggacaac 1080 

atccccggct atcgcgagtg gtaccgggcg cgtctgtcgt ggatcttcaa cgacaaggtg 1140 

tactcgtccc tccaggtcga ccccgactgg ccagagccga gcgcctcgat caatgcgacc 1200 

aaccatggtc atcgcaagtt ctacgaacgc tatctccgcg atcagctggg tgatcgaaca 1260 

gatctgatcg aggcatctct tccggactat ccgccctttg gtaagcgaat gctgctggac 1320 

aatggctggt tcacgatgct tcgtaagccc gacgtcacac tggtgcccca cggagtcgac 1380 

gccctgacac cttctggact cgtcgacacg aacggcgtcg agcaccagct ggacgtcatt 1440 

gtcatggcga cgggtttcca cagtgtgcgc gttctttacc cgatggacat cgtcggtcga 1500 

tccggccggt ccaccggaga aatctggggc gagcacgacg cgcgcgccta cctggggatc 1560 

acagttcctg acttccccaa tttcttcgtc atgaccggac cgaacaccgg cctgggacat 1620 

ggggggagct tcatcacgat cctggaatgt caggtccgct acatcatgga tgccttgaag 1680 

ttgatgcaat cggaaaacct cggcgcgatg gagtgccggg ccgaggtcaa cgatcgatac 1740 

aacgaggccg tcgaccgaca gcacgcacag atggtctgga cccatccggc aatggagaac 1800 

tggtaccgaa acccggacgg tcgcgtcgtg tcggtccttc cgtggcggat caacgactac 1860 

tgggccatga cctaccgagt cgacccgtca gattttcgta ccgagccggc acgctccgag 1920 

tcggtcccga ctccgaccgc gcgagggtga 1950 

<210> 36 
<211> 649 
<212> PRT 

<213> Rhodococcus erythropolis AN12 



<400> 36 

Met Thr He Val Thr Asp Leu Asp Arg Asp His Leu Arg Ser Ala Val 
15 10 15 

Leu Arg Gly Asn Val Pro Thr Met Leu Ala Val Leu Leu Glu Leu Thr 
20 25 30 

Ala Asp Glu Arg Trp Val Ala Pro Arg Tyr Gin Pro Thr Arg Ser Arg 
35 40 45 

Gly Met Asp Asp Asn Ser Thr Gly Gly Leu Pro Glu Glu Val Gin Ser 
50 55 SO 
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Glu lie Arg Ser Ala Leu lie Asp Ala Val Glu Arg Trp Trp Thr Leu 



Asp Glu Pro Ser Arg Arg Thr Leu Asp Ser Ser Glu Val Glu Arg He 



Leu Asn Phe Thr Cys Ser Glu Thr Val Pro Pro Asp Phe Ala Pro Met 
100 105 110 



Met Ala Glu He Val Asn Gly Pro Gin He Lys Pro Ala Thr Ala Lys 
115 120 125 



Cys Asp Glu Arg Leu His Ala He Val He Gly Ala Gly He Ala Gly 
130 135 140 



Met Leu Ala Ser Val Glu Leu Ser Arg Ala Gly He Pro His Val He 
145 150 155 160 



Leu Glu Lys Asn Asp Asp Val Gly Gly Ser Trp Trp Glu Asn Arg Tyr 
165 170 175 



Pro Gly Ala Gly Val Asp Thr Pro Ser His Leu Tyr Ser He Ser Ser 
180 185 190 



Phe Pro Arg Asn Trp Ser Thr His Phe Gly Lys Arg Asp Glu Val Gin 
195 200 205 



Gly Tyr Leu Glu Asp Phe Ala Glu Ala Asn Asp He Arg Arg Asn Val 
210 215 220 



Arg Phe Arg His Glu Val Thr Arg Ala Glu Phe Glu Glu Ser Lys Gin 
225 230 235 240 



Ser Trp Arg Val Ser Val Gin Arg Pro Gly Glu Ala Ser Glu Thr Leu 
245 250 255 



Glu Ala Pro He Leu He Ser Ala Val Gly Leu Leu Asn Arg Pro Lys 
260 265 270 



He Pro His Leu Pro Gly He Glu Thr Phe Arg Gly Arg Leu Phe His 
275 280 285 



Ser Ala Glu Trp Pro Ser Glu Leu Asp Asp Pro Glu Ser Leu Arg Gly 
290 295 300 
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Lys Arg Val Gly lie Val Gly Thr Gly Ala Ser Ala Met Gin He Gly 
305 310 315 320 



Pro Ala He Ala Asp Arg Val Gly Ser Leu Thr He Phe Gin Arg Ser 
325 330 335 



Pro Gin Trp He Ala Pro Asn Asp Asp Tyx Phe Thr Thr He Asp Asp 
340 345 350 



Gly Val His Trp Leu Met Asp Asn He Pro Gly Tyr Arg Glu Trp Tyr 
355 360 365 



Arg Ala Arg Leu Ser Trp He Phe Asn Asp Lys Val Tyr Ser Ser Leu 
370 375 380 



Gin Val Asp Pro Asp Trp Pro Glu Pro Ser Ala Ser He Asn Ala Thr 
385 390 395 400 



Asn His Gly His Arg Lys Phe Tyr Glu Arg Tyr Leu Arg Asp Gin Leu 
405 410 415 



Gly Asp Arg Thr Asp Leu He Glu Ala Ser Leu Pro Asp Tyr Pro Pro 
420 425 430 



Phe Gly Lys Arg Met Leu Leu Asp Asn Gly Trp Phe Thr Met Leu Arg 
435 440 445 



Lys Pro Asp Val Thr Leu Val Pro His Gly Val Asp Ala Leu Thr Pro 
450 455 460 



Ser Gly Leu Val Asp Thr Asn Gly Val Glu His Gin Leu Asp Val He 
465 470 475 480 



Val Met Ala Thr Gly Phe His Ser Val Arg Val Leu Tyr Pro Met Asp 
485 490 495 



He Val Gly Arg Ser Gly Arg Ser Thr Gly Glu He Trp Gly Glu His 
500 505 510 



Asp Ala Arg Ala Tyr Leu Gly He Thr Val Pro Asp Phe Pro Asn Phe 
515 520 525 



Phe Val Met Thr Gly Pro Asn Thr Gly Leu Gly His Gly Gly Ser Phe 
530 535 540 
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lie Thr lie Leu Glu Cys Gin Val Arg Tyr He Met Asp Ala Leu Lys 
545 550 555 560 

Leu Met Gin Ser Glu Asn Leu Gly Ala Met Glu Cys Arg Ala Glu Val 
565 570 ~ 575 

Asn Asp Arg Tyr Asn Glu Ala Val Asp Arg Gin His Ala Gin Met Val 
580 585 590 

Trp Thr His Pro Ala Met Glu Asn Trp Tyr Arg Asn Pro Asp Gly Arg 
595 600 605 

Val Val Ser Val Leu Pro Trp Arg He Asn Asp Tyr Trp Ala Met Thr 
610 615 620 

Tyr Arg Val Asp Pro Ser Asp Phe Arg Thr Glu Pro Ala Arg Ser Glu 
625 630 635 640 

Ser Val Pro Thr Pro Thr Ala Arg Gly 
645 

<210> 37 
<211> 1485 
<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 37 

gtgaagcttc ccgaacatgt cgaaacattg atcgtcggtg ccggattcgc cggtatgggc 60 

ttggcggcca gaatgcttcg tgacaaccga acggcggacg tcgtgttgat cgagcgcgga 120 

gctgatatcg gtggcacctg gcgagacaac acctacccag gttgtgcctg tgacgtgccg 180 

acggcgctgt actcgtattc ttttgcgccg agcgctgatt ggagtcatac ctttgctcgt 240 

cagcccgaga tctacgacta tctgaagaaa gtggccgcag acaccggcat cggggatcgc 300 

gtaatcctga actgcgaact cgaagccgct gtgtgggacg aggatgcggc gctgtggcgg 360 

gtccggacat ccctggggtc gttgacagtc aaagcgctgg tcgctgcgac cggggcgttg 420 

tcgacaccca agatcccgga ttttcccggt ctcgaccaat tctccggtac cactttccat 4 80 

tcggcgacgt ggaaccacga acacgaactg cgtggtgagc gcgtagccgt gatcggaacg 540 

ggagcgtcgg cggttcagtt cgttcccgaa attgccgacc ctgctgccca tgtcaccgtg 600 

ttccagagaa ctccggcctg ggtgattccg cgaatggatc gcaccctgcc tgcggcgcag 660 
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aaggccgtct 


actcgcggat 


tcccgctacg 


cagaaagttg ttcgcggagc ggtttacggt 


720 


tttcgcgagt 


tgctcggtgc 


cgcgatgtca 


catgcgacgt gggtcctgcc ggccttcgag 


780 


gcggccgcgc 


gcctccatct 


gcgcagacag 


gtgaaagatc cggagttgcg ccggaaactg 


840 


actcccgatt 


tcacgatcgg 


ttgcaagcgc 


atgcttctgt ccaacgactg gttgcgcacc 


900 


ctcgaccgcg 


cggacgtgag 


cctggtcgac 


agcgggctcg tctcggtcac cgagggcggg 


960 


gtggtcgacg 


ggcacggagt 


cgagcacaag 


gtcgacacca tcatcttcgc cacggggttc 


1020 


acgccgacgg 


aaccgcctgt 


ggcgcatctg 


atcaccggaa aacgtggcga aacgctggcc 


1080 


gcgcattgga 


acggtagccc 


caatgcctac 


aagggcactg cggtcagcgg gttcccgaat 


1140 


ctgttcctca 


tgtacggtcc 


gaacaccaac 


ctcggacaca gttcgatcgt gtacatgctc 


1200 


gagtcccagg 


ccgagtacgt 


caacgacgcg 


ttgaacacca tgaaacgtga gcgactggac 


1260 


gctcttgatg 


tcaacgagtc 


ggtacaggtg 


cactacaaca agggaattca gcacgagttg 


1320 


cagcacacgg 


tgtggaacaa 


gggcggatgc 


tcgagttggt acatcgatcc ggaggggcgc 


13B0 


aactcggtgc 


agtggccgac 


gttcacattc 


aaattccgtt cgctgctgga gcatttcgat 


1440 


cgtgagaact 


actccgctcg 


caagatcgaa 


agcgtccagg catga 


1485 



<210> 38 
<211> 494 
<212> PRT 

<213> Rhodococcus erythropolis AN12 
<400> 38 

Val Lys Leu Pro Glu His Val Glu Thr Leu He Val Gly Ala Gly Phe 
15 10 15 



Ala Gly Met Gly Leu Ala Ala Arg Met Leu Arg Asp Asn Arg Thr Ala 
20 25 30 



Asp Val Val Leu He Glu Arg Gly Ala Asp He Gly Gly Thr Trp Arg 
35 40 45 



Asp Asn Thr Tyr Pro Gly Cys Ala Cys Asp Val Pro Thr Ala Leu Tyr 
50 55 60 



Ser Tyr Ser Phe Ala Pro Ser Ala Asp Trp Ser His Thr Phe Ala Arg 
65 70 75 80 
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Gin Pro Glu lie Tyr Asp Tyr Leu Lys Lys Val Ala Ala Asp Thr Gly 



He Gly Asp Arg Val He Leu Asn Cys Glu Leu Glu Ala Ala Val Trp 
100 105 110 



Asp Glu Asp Ala Ala Leu Trp Arg Val Arg Thr Ser Leu Gly Ser Leu 
115 120 125 



Thr Val Lys Ala Leu Val Ala Ala Thr Gly Ala Leu Ser Thr Pro Lys 
130 135 140 



He Pro Asp Phe Pro Gly Leu Asp Gin Phe Ser Gly Thr Thr Phe His 
145 150 155 160 



Ser Ala Thr Trp Asn His Glu His Glu Leu Arg Gly Glu Arg Val Ala 
165 170 175 



Val He Gly Thr Gly Ala Ser Ala Val Gin Phe Val Pro Glu He Ala 
180 185 190 



Asp Pro Ala Ala His Val Thr Val Phe Gin Arg Thr Pro Ala Trp Val 
195 200 205 



He Pro Arg Met Asp Arg Thr Leu Pro Ala Ala Gin Lys Ala Val Tyr 
210 215 220 



Ser Arg He Pro Ala Thr Gin Lys Val Val Arg Gly Ala Val Tyr Gly 
225 230 235 240 



Phe Arg Glu Leu Leu Gly Ala Ala Met Ser HiB Ala Thr Trp Val Leu 
245 250 255 



Pro Ala Phe Glu Ala Ala Ala Arg Leu His Leu Arg Arg Gin Val Lys 
260 265 270 



Asp Pro Glu Leu Arg Arg Lys Leu Thr Pro Asp Phe Thr He Gly Cys 
275 280 285 



Asp Val Ser Leu Val Asp Ser Gly Leu Val Ser Val Thr Glu Gly Gly 
305 310 315 320 



Val Val Asp Gly His Gly Val Glu His Lys Val Asp Thr He He Phe 
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Ala Thr Gly Phe Thr Pro Thr Glu Pro Pro Val Ala His Leu lie Thr 
340 345 350 



Gly Lys Arg Gly Glu Thr Leu Ala Ala His Trp Asn Gly Ser Pro Asn 
355 360 365 



Ala Tyr Lys Gly Thr Ala Val Ser Gly Phe Pro Asn Leu Phe Leu Met 
370 375 380 



Tyr Gly Pro Asn Thr Asn Leu Gly His Ser Ser He Val Tyr Met Leu 
385 390 395 400 



Glu Ser Gin Ala Glu Tyr Val Asn Asp Ala Leu Asn Thr Met Lys Arg 
405 410 415 



Glu Arg Leu Asp Ala Leu Asp Val Asn Glu Ser Val Gin Val His Tyr 
420 425 430 



Asn Lys Gly He Gin His Glu Leu Gin His Thr Val Trp Asn Lys Gly 
435 440 445 

Gly Cys Ser Ser Trp Tyr He Asp Pro Glu Gly Arg Asn Ser Val Gin 
450 455 460 

Trp Pro Thr Phe Thr Phe Lys Phe Arg Ser Leu Leu Glu His Phe Asp 
465 470 475 480 

Arg Glu Asn Tyr Ser Ala Arg Lys He Glu Ser Val Gin Ala 
485 490 

<210> 39 
<211> 1500 
<212> DNA 

<213> Rhodococcus erythropolis AN12 
<400> 39 

atgacacagc atgtcgacgt actgatcatc ggcgctggct tgtccggaat cggcgcggct 60 
tgccacctca ttcgtgagca gaccggaagc acttacgcga tcctcgagcg ccgcgagaac 120 
atcggtggca cctgggacct gttcaagtac ccgggcatcc gttcggactc cgacatgctc 180 
accttcggat tcggtttccg tccttggatc ggcaccaaag tgctcgcaga cggcgccagt 240 
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atccgtgact acgtcgagga aaccgccaag gaatacggcg tcaccgacca catcaacttc 300 

ggccgcaagg tcgtggctat ggacttcgac cgtaccgccg cgcagtggtc cgtgaccgtc 350 

ctggtcgagg cgacagggga gaccgagacg tggaccgcga acgtcctcgt cggcgcctgt 420 

ggttactaca actacgacaa gggttaccgc cccgccttcc ccggtgagga cgacttccgc 480 

ggtcagatcg tgcacccgca gcactggccg gaggatctcg attacaccgg aaagaaggta 540 

gtggtcatcg gttccggcgc caccgcgatc acgctgatcc cgtcgatggc ccccaccgcc 600 

ggtcacgtca ccatgctgca gcgctcgccc acgtggatcc aggcgcttcc gtccgaggac 660 

cctgttgcca agggtctcaa gctcgcacgc gttcccgacc agattgctta caagattggt 720 

cgagcccgca atatcgcact gcaacgcgcc agctttcagc tttctcgcac caacccgaag 780 

ctggccaaga agctgttcct cgcccagatc cgcctgcagc tcggcaagaa cgtggacctg 840 

cgtcacttca ctcccagcta caacccgtgg gatcagcgcc tgtgcgtggt tcccaacggg 900 

gacctgttca aggtgctcaa gagcggcaag gccgacatcg tcaccgaccg tatcgccacg 960 

ttcaccgaga agggcatcgt gaccgagtcg ggccgcgaaa tcgaggccga cgtcatcgtc 1020 

acggcgaccg gcttgaacgt acagattctg ggcggcgcaa ccatgagcat cgacggcgag 1080 

ccggtcaagc tcaacgagac tgtggcctac aagagcgtgc tctactccga catcccgaac 1140 

ttcctgatga tcctcggcta caccaacgcg tcgtggacgc tcaaggctga cctggccgcg 1200 

tcctatctgt gtcgcgtgct caagatcatg cgcgatcgca gctacacgac tttcgaggtt 1260 

cacgccgaac ccgaggactt cgccgaagaa tctctcatgg gcggagccct gacctcgggc 1320 

tacatccagc gcggcgacgg agaaatgccg cgtcagggtg cccgcggcgc gtggaaagtg 13 80 

gtcaacaatt actaccgcga ccgcaagctg atgcacgacg ccgagatcga agacggtgtg 144 0 

ctgcagttca gcaaggtcga tattgctgtc gtgcctgata gcaaggtcgc cagcgcatag 1500 

<210> 40 
<211> 499 
<212> PRT 

<213> Rhodococcus erythropolis AN12 
<400> 40 

Met Thr Gin, His Val Asp Val Leu He lie Gly Ala Gly Leu Ser Gly 
15 10 15 

He Gly Ala Ala Cys His Leu He Arg Glu Gin Thr Gly Ser Thr Tyr 
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Ala lie Leu Glu Arg Arg Glu Asn He Gly Gly Thr Trp Asp Leu Phe 
35 40 45 

Lys Tyr Pro Gly He Arg Ser Asp Ser Asp Met Leu Thr Phe Gly Phe 
50 55 60 

Gly Phe Arg Pro Trp He Gly Thr Lys Val Leu Ala Asp Gly Ala Ser 
65 70 75 80 

He Arg Asp Tyr Val Glu Glu Thr Ala Lys Glu Tyr Gly Val Thr Asp 



His He Asn Phe Gly Arg Lys Val Val Ala Met Asp Phe Asp Arg Thr 
100 105 110 



Ala Ala Gin Trp Ser Val Thr Val Leu Val Glu Ala Thr Gly Glu Thr 
115 120 125 



Glu Thr Trp Thr Ala Asn Val Leu Val Gly Ala Cys Gly Tyr Tyr Asn 
130 135 140 



Tyr Asp Lys Gly Tyr Arg Pro Ala Phe Pro Gly Glu Asp Asp Phe Arg 
145 150 155 160 



Gly Gin He Val His Pro Gin His* Trp Pro Glu Asp Leu Asp Tyr Thr 
165 170 175 



Gly Lys Lys Val Val Val He Gly Ser Gly Ala Thr Ala He Thr Leu 
180 185 190 



He Pro Ser Met Ala Pro Thr Ala Gly His Val Thr Met Leu Gin Arg 
195 200 205 



Ser Pro Thr Trp He Gin Ala Leu Pro Ser Glu Asp Pro Val Ala Lys 
210 215 220 



Gly Leu Lys Leu Ala Arg Val Pro Asp Gin He Ala Tyr Lys He Gly 
225 230 235 240 



Arg Ala Arg Asn He Ala Leu Gin Arg Ala Ser Phe Gin Leu Ser Arg 
245 250 255 



Thr Asn Pro Lys Leu Ala Lys Lys Leu Phe Leu Ala Gin He Arg Leu 
260 265 270 
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Gin Leu Gly Lys Asn Val Asp Leu Arg His Phe Thr Pro Ser Tyr Asn 
275 280 285 



Pro Trp Asp Gin. Arg Leu Cys Val Val Pro Asn Gly Asp Leu Phe Lys 
290 295 300 



Val Leu Lys Ser Gly Lys Ala Asp He Val Thr Asp Arg He Ala Thr 
305 310 315 320 



Phe Thr Glu Lys Gly He Val Thr Glu Ser Gly Arg Glu He Glu Ala 
325 330 335 



Asp Val He Val Thr Ala Thr Gly Leu Asn Val Gin He Leu Gly Gly 
340 345 350 



Ala Thr Met Ser He Asp Gly Glu Pro Val Lys Leu Asn Glu Thr Val 
355 360 365 



Ala Tyr Lys Ser Val Leu Tyr Ser Asp He Pro Asn Phe Leu Met He 
370 375 380 



Leu Gly Tyr Thr Asn Ala Ser Trp Thr Leu Lys Ala Asp Leu Ala Ala 
385 390 395 400 



: Tyr Leu Cys Arg Val Leu Lys He Met Arg Asp Arg Ser Tyr Thr 
405 410 " 415 



Thr Phe Glu Val His Ala Glu Pro Glu Asp Phe Ala Glu Glu Ser Leu 
420 425 430 



Met Gly Gly Ala Leu Thr Ser Gly Tyr He Gin Arg Gly Asp Gly Glu 
435 440 445 



Met Pro Arg Gin Gly Ala Arg Gly Ala Trp Lys Val Val Asn Asn Tyr 
450 455 460 



Tyr Arg Asp Arg Lys Leu Met His Asp Ala Glu He Glu Asp Gly Val 
465 470 475 480 



Leu Gin Phe Ser Lys Val Asp He Ala Val Val Pro Asp Ser Lys Val 
485 490 495 
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<210> 41 
<211> 1482 
<212> DNA 

<213> Rhodococcus erythropolis AN12 
<400> 41 

atgtcatcac gggtcaacga cggccacatc gcgatcatcg gaaccgggtt ttccgggctg 60 

tgcatggcga tcgaactgaa gaagaagggc atcgacgact tcgtcctgta cgaacgcgcc 120 

gacgatgtcg gcggaacctg gcgcgacaac acatacccag gggcagcctg cgatgtgccc 180 

agcgtgttgt attcctactc cttcgctcag aacccgaact ggacccgtat cttcccgcca 240 

tggtcggaac tgctcgacta tctcagatct gttgctgcgc agtatgattt gctgccgcac 3 00 

atccgcttcg gtgtcgaggt ctccgaaatg cggttcgacg aggaccggct ccggtggaac 360 

atccagttcg catccggcga atcagtgacg gcggccgttg tcgtcaacgg ctcagggggc 420 

ttgagtaatc cgtacatccc gcagctaccc ggactggaat cattcgaggg tgccgcattc 480 

cactccgcca agtggcgaca tgacctcgac atgtcgggaa ggcgtgtcgc ggtgataggt 540 

tccggcgcca gtgcgatcca gttcgtcccc gaaatcgccc cgcacaccga gacccttcat 600 

gtgtttcagc gatcacccaa ctgggtcatg ccacgtggtg atgccgcgct gtcgcccgcc 660 

acccgcgaaa gattctcacg gcgtccttat cgtcaacggt ggctgcgatg gcggacctac 720 

tgggcattcg aaaagctcgc cagcgccttc ctcggaaatc gcaaactcgt cgaacagtac 780 

cgatcccagg cgctcgccaa tcttcaacag caagtgccgg attcggactt gaggcagaag 840 

gtcaccccag attacgatcc tggctgtaaa cgtcgcttga tatccgacga ctggtacccc 900 

gcgctgcaac gggaaaatgt gcacttgaac acctcggggg tttccgagat ccgcccgcat 960 

tcgatcattg actcagaggg agcggaacac gaagtcgaca ccctgatctt cgcgaccgga 1020 

ttccaggcaa ccagcttcct ggcaccgatg aaagtattcg gccgcgaagg agtcgaactc 1080 

tccgacagtt ggcgcgaggg cgccgcaaca aagctcgggc ttgcatccgc cgcgttcccg 1140 

aacctgtggt tcctcaacgg cccgaatacc ggtctcggtc acaactcgat catcttcatg 1200 

atcgaagcac aagccagata catcgcttcg gcagtgcagt acatgcgccg aaaaagtatc 1260 

actgccctcg aactcgatcg caccgtccag acaggcagct acgccgccac ccaagaacgc 1320 

atgcgccgaa ctgtatgggc atcgggtggc tgcgacagct ggtatcaatc cgctgacggt 1380 

cgaatcgaca ccctgtggcc ggccagcaca atcgaatact ggttgcgcac caggctattc 1440 

cgcaagtccg acttccatgc actgacgaca ggcaaaggat ga 1482 
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<210> 42 
<211> 493 
<212> PRT 

<213> Rhodococcus erythropolis AN12 
<400> 42 

Met Ser Ser Arg Val Asn Asp Gly His lie Ala He He Gly Thr Gly 
15 10 15 

Phe Ser Gly Leu Cys Met Ala He Glu Leu Lys Lys Lys Gly He Asp 



Asp Phe Val Leu Tyr Glu Arg Ala Asp Asp Val Gly Gly Thr Trp Arg 
35 40 45 



Asp Asn Thr Tyr Pro Gly Ala Ala Cys Asp Val Pro Ser Val Leu Tyr 
50 55 60 



Ser Tyr Ser Phe Ala Gin Asn Pro Asn Trp Thr Arg He Phe Pro Pro 



Trp Ser Glu Leu Leu Asp Tyr Leu Arg Ser Val Ala Ala Gin Tyr Asp 



Leu Leu Pro His He Arg Phe Gly Val Glu Val Ser Glu Met Arg Phe 
100 105 110 



Asp Glu Asp Arg Leu Arg Trp Asn He Gin Phe Ala Ser Gly Glu Ser 
115 120 125 



Val Thr Ala Ala Val Val Val Asn Gly Ser Gly Gly Leu Ser Asn Pro 
130 135 140 



Tyr He Pro Gin Leu Pro Gly Leu Glu Ser Phe Glu Gly Ala Ala Phe 
145 150 155 160 



His Ser Ala Lys Trp Arg His Asp Leu Asp Met Ser Gly Arg Arg Val 
165 170 175 



Ala Val He Gly Ser Gly Ala Ser Ala He Gin Phe Val Pro Glu He 
180 185 190 
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Ala Pro His Thr Glu Thr Leu His Val Phe Gin Arg Ser Pro Asn Trp 
195 200 205 



Val Met Pro Arg Gly Asp Ala Ala Leu Ser Pro Ala Thr Arg Glu Arg 
210 215 220 



Phe Ser Arg Arg Pro Tyr Arg Gin Arg Trp Leu Arg Trp Arg Thr Tyr 
225 230 235 240 



Trp Ala Phe Glu Lys Leu Ala Ser Ala Phe Leu Gly Asn Arg Lys Leu 
245 250 255 



Val Glu Gin Tyr Arg Ser Gin Ala Leu Ala Asn Leu Gin Gin Gin Val 
260 265 270 



Pro Asp Ser Asp Leu Arg Gin Lys Val Thr Pro Asp Tyr Asp Pro Gly 
275 280 285 



Cys Lys Arg Arg Leu lie Ser Asp Asp Trp Tyr Pro Ala Leu Gin Arg 
290 295 300 



Glu Asn Val His Leu Asn Thr Ser Gly Val Ser Glu lie Arg Pro His 
305 310 • 315 320 



Ser lie lie Asp Ser Glu Gly Ala Glu His Glu Val Asp Thr Leu lie 
325 330 335 



Phe Ala Thr Gly Phe Gin Ala Thr Ser Phe Leu Ala Pro Met Lys Val 
340 345 350 



Phe Gly Arg Glu Gly Val Glu Leu Ser Asp Ser Trp Arg Glu Gly Ala 
355 360 365 



Ala Thr Lys Leu Gly Leu Ala Ser Ala Ala Phe Pro Asn Leu Trp Phe 
370 375 380 



Leu Asn Gly Pro Asn Thr Gly Leu Gly His Asn Ser lie lie Phe Met 
385 390 395 400 



He Glu Ala Gin Ala Arg Tyr He Ala Ser Ala Val Gin Tyr Met Arg 
405 410 415 



Arg Lys Ser He Thr Ala Leu Glu Leu Asp Arg Thr Val Gin Thr Gly 
420 425 430 
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Ser Tyr Ala Ala Thr Gin Glu Arg Met Arg Arg Thr Val Trp Ala Ser 
435 440 445 

Gly Gly Cys Asp Ser Trp Tyr Gin Ser Ala Asp Gly Arg He Asp Thr 
450 455 460 

Leu Trp Pro Ala Ser Thr He Glu Tyr Trp Leu Arg Thr Arg Leu Phe 
465 470 475 " 480 

Arg Lys Ser Asp Phe His Ala Leu Thr Thr Gly Lys Gly 
485 4 90 

<210> 43 
<211> 1626 
<212> DNA 

<213> Rhodococcus erythropolis AN12 
<400> 43 

atgactacac aaaaggccct gaccactgtc gatgccatcg tcatcggcgc cggattcggc 60 

gggatctacg ccgtccacaa actggccaac gagctcggcc tcacgacggt cggcttcgac 120 

aaggcagacg gcccgggcgg cacgtggtac tggaaccgct acccgggtgc actgtccgac 180 

accgaaagcc acgtctaccg gttctcattc gaccgtgacc tgcttcagga cggtacctgg 240 

aagcacacct acaccactca acccgagatt ctcgaatacc ttgaggatgt cgtttcccgg 300 

ttcgacctac gccggcactt ccacttcggc actgccgtcg aatctgcggt gtatctcgaa 360 

gacgaacaac tgtgggaagt caccaccgac acaggcgaga tctaccgcgc tacctacgtc 420 

gtcaatgctg tcgggctcct ctccgccatc aatcgaccgg atctgcccgg tctcgagaca 480 

ttcgaaggcg agaccatcca caccgcagcg tggcccgagg gcaaggatct caccggccgc 540 

cgcgtcggcg tgatcggtac cggatctact gggcaacagg tcatcacggc cctggcgcca 600 

acggtcgaac acctcactgt attcgtgcga actccccagt actcggtgcc ggtcggcaag 660 

cgcgcggtga ccgacgagca gatcgacgca gtcaaagccg actacgagaa catctggact 720 

caggtcaaaa gatcctcggt ggcattcggc ttcgaggaat ctactgttcc ggccatgagc 780 

gtgtccgcgg aagaacgcct cagggtctac gaagaggcat gggagcaggg cggcggtttc 840 

cgattcatgt tcggaacctt cggtgacatc gctaccgacg aagaagccaa cgaaactgca 900 

gcatcgttca ttcgctcgaa gatcaccgcc atgatcgaag acccggagac tgcccgcaaa 960 

ctgacgccca ccggactatt cgcgagacga ccgttgtgcg acgacgggta cttccaggtc 1020 
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ttcaaccgcc 


cgaacgtcga ggcggtcgcc atcaaggaaa accccattcg tgagatcaca 


1080 


gccaagggcg 


tggtgaccga ggacggcgtc ctgcacaaat tggacgtcct ggtcctcgcc 


1140 


accggcttcg 


acgccgtcga cgggaactac cgccgcatga ccatttccgg tcgcggtggc 


1200 


ctgaacatca 


acgaccattg ggacggccaa cccaccagct acctggggat tgccaccgcg 


1260 


aacttcccca 


actggttcat ggtgctcggc cccaacggac cgttcacgaa ccttcctcca 


1320 


agcatcgaaa 


ctcaggtcga gtggatcagc gacaccatag gttacgtcga gcggacaggt 


1380 


gtgcgggcga 


tcgaacccac accggaggcg gaatccgcat ggaccgcgac ctgcacggac 


1440 


atcgcgaaca 


tgaccgtctfc caccaaggfcfc gattcatgga tcttcggggc caatgttcca 




ggaaagaagc 


ccagcgtgct gttctacctt ggcgggctcg gcaactaccg cgccgtcctg 


1560 


gcagacgtca 


ccgagggggg ctatcagggc tttgctctga agacggccga caccgtcgac 


1620 


gcctga 




1626 


<210> 44 






<211> 541 






<212> PRT 






<213> Rhod 


iococcus erythropolis AN12 





<400>' 44 

Met Thr Thr Gin Lys Ala Leu Thr Thr Val Asp Ala He Val He Gly 
15 10 15 

/ 

Ala Gly Phe Gly Gly He Tyr Ala Val His Lys Leu Ala Asn Glu Leu 
20 25 30 

Gly Leu Thr Thr Val Gly Phe Asp Lys Ala Asp Gly Pro Gly Gly Thr 
35 40 45 

Trp Tyr Trp Asn Arg Tyr Pro Gly Ala Leu Ser Asp Thr Glu Ser His 
50 55 60 

Val Tyr Arg Phe Ser Phe Asp Arg Asp Leu Leu Gin Asp Gly Thr Trp 
65 70 75 80 

Lys His Thr Tyr Thr Thr Gin Pro Glu He Leu Glu Tyr Leu Glu Asp 
85 90 95 

Val val Ser Arg Phe Asp Leu Arg Arg His Phe His Phe Gly Thr Ala 
100 105 110 
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Val Glu Ser Ala Val Tyr Leu Glu Asp Glu Gin Leu Trp Glu Val Thr 
115 120 125 



Thr Asp Thr Gly Glu lie Tyr Arg Ala Thr Tyr Val Val A3n Ala Val 
130 135 140 



Gly Leu Leu Ser Ala He Asn Arg Pro Asp Leu Pro Gly Leu Glu Thr 
145 150 155 160 



Phe Glu Gly Glu Thr He His Thr Ala Ala Trp Pro Glu Gly Lys Asp 
165 170 175 



Leu Thr Gly Arg Arg Val Gly Val He Gly Thr Gly Ser Thr Gly Gin 
180 185 190 



Gin Val He Thr Ala Leu Ala Pro Thr Val Glu His Leu Thr Val Phe 
195 200 205 



Val Arg Thr Pro Gin Tyr Ser Val Pro Val Gly Lys Arg Ala Val Thr 
210 215 220 



Asp Glu Gin He Asp Ala Val Lys Ala Asp Tyr Glu Asn He Trp Thr 
225 230 235 240 



Gin Val Lys Arg Ser Ser Val Ala Phe Gly Phe Glu Glu Ser Thr Val 
245 250 255 



Pro Ala Met Ser Val Ser Ala Glu Glu Arg Leu Arg Val Tyr Glu Glu 
260 265 270 



Ala Trp Glu Gin Gly Gly Gly Phe Arg Phe Met Phe Gly Thr Phe Gly 
275 280 285 



Asp He Ala Thr Asp Glu Glu Ala Asn Glu Thr Ala Ala Ser Phe He 
290 295 300 



Arg Ser Lys He Thr Ala Met He Glu Asp Pro Glu Thr Ala Arg Lys 
305 310 315 320 



Leu Thr Pro Thr Gly Leu Phe Ala Arg Arg Pro Leu Cys Asp Asp Gly 
325 330 335 



Tyr Phe Gin Val Phe Asn Arg Pro Asn Val Glu Ala Val Ala He Lys 
340 345 350 
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Glu Asn Pro lie Arg Glu lie Thr Ala Lys Gly Val Val Thr Glu Asp 
355 360 365 



Gly Val Leu His Lys Leu Asp Val Leu Val Leu Ala Thr Gly Phe Asp 
370 375 380 



Ala Val Asp Gly Asn Tyr Arg Arg Met Thr lie Ser Gly Arg Gly Gly 
385 390 395 " " 400 



Leu Asn lie Asn Asp His Trp Asp Gly Gin Pro Thr Ser Tyr Leu Gly 
405 410 415 



lie Ala Thr Ala Asn Phe Pro Asn Trp Phe Met Val Leu Gly Pro Asn 
420 425 430 



Gly Pro Phe Thr Asn Leu Pro Pro Ser He Glu Thr Gin Val Glu Trp 
435 440 445 



He Ser Asp Thr He Gly Tyr Val Glu Arg Thr Gly Val Arg Ala He 
450 455 460 



Glu Pro Thr Pro Glu Ala Glu Ser Ala Trp Thr Ala Thr Cys Thr Asp 
465 470 475 480 



He Ala Asn Met Thr Val Phe Thr Lys Val Asp Ser Trp He Phe Gly 
485 490 495 



Ala Asn Val Pro Gly Lys Lys Pro Ser Val Leu Phe Tyr Leu Gly Gly 
500 505 510 



Leu Gly Asn Tyr Arg Ala Val Leu Ala Asp Val Thr Glu Gly Gly Tyr 
515 520 525 



Gin Gly Phe Ala Leu Lys Thr Ala Asp Thr Val Asp Ala 
530 535 540 



<210> 45 

<211> 1638 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 45 

atgacaacta ccgaatccag aactcagacc gacaaggctg gggccgtcac gctcgatgcg 60 
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ttgatcatcg gcgccggagt cgccggtttg tatcagctcc acatgcttcg cgagcaggga 120 

ctgaacgtcc gcgcctacga cgctgcggaa gacgtcggcg gtacgtggta ctggaaccgt 180 

tacccaggcg cacgattcga ctccgaagcc tacatctacc agtacctgtt ctccgaggac 24 0 

ctgtacaaga actggagctg gagtcaacgc ttcccggccc agcccgaaat tgagcggtgg 300 

atgcgctacg tcgccgacac cctggacctg cgtcgcagca ttcagttttc cacaacaatc 360 

accagcgccg agttcgacga ggtagctgag cgttggacca ttcgcaccga ccgcggcgag 420 

gaaatcagca cccgattctt catcacctgt tgcggaatgc tgtcggcgcc gatggaagat 480 

ttgttccccg gacaacagga cttccggggg cagatcttcc acacctcgcg atggccgcac 540 

ggagatgtag aactcaccgg taagcgtgtc ggtgtcgtcg gcgtcggcgc cactggcatt 600 

caggtaatcc agaccatcgc cgacgaggtt gatcaactga aggtgttcgt gcggacaccc 660 

cagtacgcct tgccgatgaa aaaccctcag tacgacagcg acgacgtcgc ggcctacaag 720 

gaccgattcg aggagcttcg aaccacactg ccgcacacct tcacaggctt cgaatacgat 780 

ttcgaatacg tgtgggccga cctagccccc gaacagcgcc gcgaggtgct cgagaacatc 840 

tacgagtacg gatcactcaa gctgtggctg tcgtcgttcg cggagatgtt cttcgatgag 900 

caggtcagtg acgagatctc cgagttcgtt cgcgagaaaa tgcgggcgcg gctcatcgat 960 

ccggagctgt gcgacctgct gattcccact gactatggct tcggcacaca ccgtgtgccg 1020 

ctcgaaacca actacctcga ggtgtaccac cgcccgaatg tgacggccat cggcgtcaag 1080 

aacaacccga tcgcgcgaat cgtcccccaa ggcatcgagt tgaccgacgg taccttccac 1140 

gaactagacg tgatcatttt ggccactggg ttcgatgcag gcaccggcgc actgactcga 1200 

atcgacatcc gcggccgcgg tggtcggtct ctgaaggaag actggggacg cgatattcgc 1260 

acgacaatgg gcctgatggt gcacggttac ccgaacatgc tgacgaccgc cgtgcccctg 1320 

gcaccctccg cggcactgtg caacatgacc acgtgcttgc agcagcagac cgagtggatc 1380 

agcgaagcaa ttcgctacat gcaagagcgc gatctgaccg tcatcgagcc taccaaggag 1440 

gccgaggacg cgtgggtggc gcaccacgac gaaacagccg cagtgaatct gatctccaag 1500 

acggattcct ggtacgtagg ttccaacgtt ccagggaagc cgcgacgggt cctgtcctac 1560 

acggggggag tcggcgcata ccgagaaaag gcgcaggaaa tcgccgacgc cggatacaag 1620 

ggcttcaatc tgcgctga 1638 



<210> 46 
<211> 545 



WO 03/020890 



PCIVUS02/27549 



<212> PRT 

<213> Rhodococcus erythropolis AN12 
<400> 46 

Met Thr Thr Thr Glu Ser Arg Thr Gin Thr Asp Lys Ala Gly Ala Val 
15 10 15 

Thr Leu Asp Ala Leu He He Gly Ala Gly Val Ala Gly Leu Tyr Gin 
20 25 30 

Leu His Met Leu Arg Glu Gin Gly Leu Asn Val Arg Ala Tyr Asp Ala 
35 40 45 

Ala Glu Asp Val Gly Gly Thr Trp Tyr Trp Asn Arg Tyr Pro Gly Ala 



Arg Phe Asp Ser Glu Ala Tyr He Tyr Gin Tyr Leu Phe Ser Glu Asp 
65 70 75 80 



Leu Tyr Lys Asn Trp Ser Trp Ser Gin Arg Phe Pro Ala Gin Pro Glu 
85 90 95 



He Glu Arg Trp Met Arg Tyr Val Ala Asp Thr Leu Asp Leu Arg Arg 
100 105 110 



Ser He Gin Phe Ser Thr Thr He Thr Ser Ala Glu Phe Asp Glu Val 
115 120 125 



Ala Glu Arg Trp Thr He Arg" Thr Asp Arg Gly Glu Glu He Ser Thr 
130 135 140 



Arg Phe Phe He Thr Cys Cys Gly Met Leu Ser Ala Pro Met Glu Asp 
145 150 155 160 



Leu Phe Pro Gly Gin Gin Asp Phe Arg Gly Gin He Phe His Thr Ser 
165 170 175 



Arg Trp Pro His Gly Asp Val Glu Leu Thr Gly Lys Arg Val Gly Val 
180 185 190 



Val Gly Val Gly Ala Thr Gly He Gin Val He Gin Thr He Ala Asp 
195 200 205 



Glu Val Asp Gin Leu Lys Val Phe Val Arg Thr Pro Gin Tyr Ala Leu 
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Pro Met Lys Asn Pro Gin Tyr Asp Ser Asp Asp Val Ala Ala Tyr Lys 
225 230 235 240 



Asp Arg Phe Glu Glu Leu Arg Thr Thr Leu Pro His Thr Phe Thr Gly 
245 250 255 



Phe Glu Tyr Asp Phe Glu Tyr Val Trp Ala Asp Leu Ala Pro Glu Gin 
260 265 270 



Arg Arg Glu Val Leu Glu Asn He Tyr Glu Tyr Gly Ser Leu Lys Leu 
275 280 285 



Trp Leu Ser Ser Phe Ala Glu Met Phe Phe Asp Glu Gin Val Ser Asp 
290 295 300 



Glu He Ser Glu Phe Val Arg Glu Lys Met Arg Ala Arg Leu He Asp 
305 310 315 320 



Pro Glu Leu Cys Asp Leu Leu He Pro Thr Asp Tyr Gly Phe Gly Thr 
325 330 335 



His Arg Val Pro Leu Glu Thr Asn Tyr Leu Glu Val Tyr His Arg Pro 
340 345 350 



Asn Val Thr Ala He Gly Val Lys Asn Asn Pro He Ala Arg He Val 
355 360 365 



Pro Gin Gly He Glu Leu Thr Asp Gly Thr Phe His Glu Leu Asp Val 
370 375 380 



He He Leu Ala Thr Gly Phe Asp Ala Gly Thr Gly Ala Leu Thr Arg 
385 390 395 400 



He Asp He Arg Gly Arg Gly Gly Arg Ser Leu Lys Glu Asp Trp Gly 
405 410 415 



Arg Asp He Arg Thr Thr Met Gly Leu Met Val His Gly Tyr Pro Asn 
420 425 430 



Met Leu Thr Thr Ala Val Pro Leu Ala Pro Ser Ala Ala Leu Cys Asn 
435 440 445 



Met Thr Thr Cys Leu Gin Gin Gin Thr Glu Trp lie Ser Glu Ala He 
450 455 460 
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Arg Tyr Met Gin Glu Arg Asp Leu Thr Val lie Glu Pro Thr Lys Glu 
465 470 475 480 



Ala Glu Asp Ala Trp Val Ala His His Asp Glu Thr Ala Ala Val Asn 
485 490 495 



Leu lie Ser Lys Thr Asp Ser Trp Tyr Val Gly Ser Asn Val Pro Gly 
500 505 510 



Lys Pro Arg Arg Val Leu Ser Tyr Thr Gly Gly Val Gly Ala Tyr Arg 
515 520 525 



Glu Lys Ala Gin Glu lie Ala Asp Ala Gly Tyr Lys Gly Phe Asn Leu 
530 535 540 



<210> 47 
<211> 540 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus sequence 
<400> 47 

Met Thr Ala Gin Glu Ser Leu Thr Val Val Asp Ala Val Val lie Gly 
15 10 15 

Ala Gly Phe Gly Gly lie Tyr Ala Val His Lys Leu Arg Glu Gin Gly 
20 25 30 

Leu Thr Val Val Gly Phe Asp Ala Ala Asp Gly Pro Gly Gly Thr Trp 
35 40 45 

Tyr Trp Asn Arg Tyr Pro Gly Ala Leu Ser Asp Thr Glu Ser His Val 
50 55 60 

Tyr Arg Phe Ser Phe Asp Glu Asp Leu Leu Gin Asp Trp Thr Trp Lys 
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Glu Thr Tyr Pro Thr Gin Pro Glu lie Leu Glu Tyr Leu Glu Asp Val 



Val Asp Arg Phe Asp Leu Arg Arg Asp Phe Arg Phe Gly Thr Glu Val 
100 105 110 



Thr Ser Ala Thr Tyr Leu Glu Asp Glu Asn Leu Trp Glu Val Thr Thr 
115 120 125 



Asp Gly Gly Glu Val Tyr Arg Ala Arg Phe Val Val Asn Ala Val Gly 
130 135 140 



Leu Leu Ser Ala lie Asn Phe Pro Asn lie Pro Gly Leu Asp Thr Phe 
145 150 155 " 160 



Glu Gly Glu Thr lie His Thr Ala Ala Trp Pro Glu Gly Val Asp Leu 
165 170 175 



Thr Gly Lys Arg Val Gly Val lie Gly Thr Gly Ser Thr Gly lie Gin 
180 185 190 



Val He Thr Ala Leu Ala Pro Glu Val Glu His Leu Thr Val Phe Val 
195 200 205 



Arg Thr Pro Gin Tyr Ser Val Pro Val Gly Asn Arg Pro Val Thr Ala 
210 215 220 



Glu Gin He Asp Ala He Lys Ala Asp Tyr Asp Glu He Trp Ala Gin 
225 230 235 240 



Val Lys Arg Ser Gly Val Ala Phe Gly Phe Glu Glu Ser Thr Val Pro 
245 250 255 



Ala Met Ser Val Ser Glu Glu Glu Arg Asn Arg Val Phe Glu Glu Ala 
260 265 270 



Trp Glu Glu Gly Gly Gly Phe Arg Phe Met Phe Gly Thr Phe Gly Asp 
275 280 285 



He Ala Thr Asp Glu Ala Ala Asn Glu Thr Ala Ala Ser Phe He Arg 
290 295 300 



Ser Lys He Arg Glu He Val Lys Asp Pro Glu Thr Ala Arg Lys Leu 
305 310 315 320 
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Thr Pro Thr Gly Leu Phe Ala Arg Arg Arg Leu Cys Asp Asp Gly Tyr 
325 330 335 



Tyr Glu Val Tyr Asn Arg Pro Asn Val Glu Ala Val Asp He Lys Glu 
340 345 350 



Asn Pro He Arg Glu He Thr Ala Lys Gly Val Val Thr Glu Asp Gly 
355 360 365 



Val Leu His Glu Leu Asp Val Leu Val Phe Ala Thr Gly Phe Asp Ala 
370 375 380 



Val Asp Gly Asn Tyr Arg Arg He Asp He Arg Gly Arg Gly Gly Leu 
385 390 395 " ~* 400 



Ser Leu Asn Asp His Trp Asp Gly Gin Pro Thr Ser Tyr Leu Gly Leu 
405 410 415 



Ser Thr Ala Gly Phe Pro Asn Trp Phe Met Val Leu Gly Pro Asn Gly 
420 425 430 



Pro Phe Thr Asn Leu Pro Pro Ser He Glu Thr Gin Val Glu Trp He 
435 440 445 



Ser Asp Thr He Ala Tyr Ala Glu Glu Asn Gly He Arg Ala He Glu 
. 450 455 460 



Pro Thr Pro Glu Ala Glu Asp Glu Trp Thr Ala Thr Cys Thr Asp He 
465 470 475 480 



Ala Asn Ala Thr Leu Phe Thr Lys Ala Asp Ser Trp He Phe Gly Ala 
485 490 495 ■ 



Asn Val Pro Gly Lys Lys Pro Ser Val Leu Phe Tyr Leu Gly Gly Leu 
500 505 510 



Gly Asn Tyr Arg Ala Val Leu Ala Asp Val Ala Ala Ala Gly Tyr Arg 
515 520 525 



Gly Phe Ala Leu Lys Ser Ala Asp Ala Val Thr Ala 
530 535 540 



<210> 48 
<211> 497 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus sequence 
<220> 

<221> MISC_FEATURE 

<222> (3).. (3) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (6).. (6) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (9).. (9) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (30).. (30) 

<223> G or A or T or C 

<220> 

<221> MIS C_FEATURE 

<222> (63).. (63) 

<223> G or A or T or C 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

c220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



MISC_FEATORE 
(81) . . (81) 
G or A or T or C 



MIS C_FEATURE 
(95).. (95) 
G or A or T or C 



MISC_FEATURE 
(96) . . (96) 
G or A or T or C 



MISC_FEATURE 
(99) . . (99) 
G or A or T or C 



MIS C_FEATURE 
(100) . . (100) 
G or A or T or C 



MISC_FEATURE 
(123) . . (123) 
G or A or T or C 
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<220> 

<221> MISC_FEATURE 

<222> (132).. (132) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (143).. (143) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (158).. (158) 

<223> G or A or T or C 

<220> 

<221> MISC_PEATURE 

<222> (159).. (159) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURB 

<222> (164).. (164) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (195) . . (195) 

<223> G or A or T or C 



PCT/US02/27549 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221* 
— <222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(197) . . (197) 
G or A or T or C 



MISC_FEATURE 
(215) . . (215) 
G or A or T or C 



MIS C_FEATURE 
(219) . . (219) 
G or A or T or C 



MISC_FEATURE 
(221) . . (221) 
G or A or T or C 



MIS C_FEATURE 
(237) . . (237) 
G or A or T or C 



MISC_FEATORE 
(252) . . (252) 
G or A or T or C 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(254) . . (254) 
G or A or T or C 



MISC_FEATURE 
(259) . . (259) 
G or A or T or C 



MIS C_FE ATURE 
(260) . . (260) 
G or A or T or C 



MISC_FEATURE 
(261) . . (261) 
G or A or T or C 



MISC_FEATURE 
(279) . . (279) 
G or A or T or C 



MISC_FEATURE 
(303) . . (303) 
G or A or T or C 
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<220> 

<221> MIS C_FEATURE 
<222> (320).. (320) 

<223> G or A or T or C 

<220> 

<221> MIS C_FEATURE 
<222> (343) . . (343) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (346).. (346) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (348).. (348) 

<223> G or A or T or C 

<220> 

<221> MIS C_F EATURE 

<222> (369) . . (369) 

<223> G or A or T or C 

<220> 

<221> MISCJFEATURE 

<222> (384).. (384) 

<223> G or A or T or C 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



MIS C_FEATURE 
(399) . . (399) 
G or A or T or C 



MISC_FEATURE 
(413) .. (413) 
G or A or T or C 



MISC_FEATURE 
(414).. (414) 
G or A or T or C 



MISC_FEATURE 
(431) . . (431) 
G or A or T or C 



MISC_FEATURE 
(432) . . (432) 
G or A or T or C 



MISC_FEATURE 
(435).. (435) 
G or A or T or C 
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<220> 

<221> MIS C_FEATURE 

<222> (456).. (456) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (465).. (465) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (471) . . (471) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATORE 

<222> (472).. (472) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (486).. (486) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (495).. (495) 

<223> G or A or T or C 
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<220> 

<221> MISC_FEATURE 
<222> (496).. (495) 
<223> G or A or T or C 

<400> 48 

Met Val Xaa lie Pro Xaa Arg His Xaa Glu Val Val He He Gly Ala 
15 10 15 

Gly Phe Ala Gly He Gly Ala Ala Val Glu Leu Lys Arg Xaa Gly He 
20 25 30 

Asp Asp Phe Val Leu Leu Glu Arg Ala Asp Asp Val Gly Gly Thr Trp 
35 40 45 

Arg Asp Asn Thr Tyr Pro Gly Ala Ala Cys Asp Val Pro Ser Xaa Leu 
50 55 60 

Tyr Ser Tyr Ser Phe Ala Pro Asn Pro Asn Trp Thr Arg Leu Phe Ala 
65 70 75 80 

Xaa Gin Pro Glu He Tyr Asp Tyr Leu Glu Asp Val Ala Ala Xaa Xaa 



Gly Leu Xaa Xaa His Val Arg Phe Gly Val Glu Val Thr Glu Ala Arg 
100 105 110 



Trp Asp Glu Ser Ala Gin Leu Trp Arg Val Xaa Thr Ala Ser Gly Glu 
115 120 125 



Leu Thr Ala Xaa Phe Leu Val Ala Ala Thr Gly Pro Leu Ser Xaa Pro 
130 135 140 



Lys He Pro Asp Leu Pro Gly Leu Glu Ser Phe Glu Gly Xaa Xaa Phe 
145 150 155 160 



His Ser Ala Xaa Trp Asn His Asp Leu Asp Leu Arg Gly Glu Arg Val 
165 170 175 



Ala Val Val Gly Thr Gly Ala Ser Ala Val Gin Phe Val Pro Glu He 
180 185 190 



Ala Asp Xaa Ala Xaa Thr Leu Thr Val Phe Gin Arg Thr Pro Gin Trp 
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Val Leu Pro Arg Pro Asp Xaa Thr Leu Pro Xaa Ala Xaa Arg Ala Val 
210 215 220 



Phe Ser Arg Val Pro Gly Thr Gin Lys Trp Leu Arg Xaa Arg Leu Tyr 
225 230 235 240 



Gly lie Phe Glu Ala Leu Gly Ser Gly Phe Val Xaa Pro Xaa Trp Leu 
245 250 255 



Leu Pro Xaa Xaa Xaa Ala Leu Ala Arg Ala His Leu Arg Arg Gin Val 
260 265 270 



Arg Asp Pro Glu Leu Arg Xaa Lys Leu Thr Pro Asp Tyr Thr Pro Gly 
275 280 285 



Cys Lys Arg Met Leu Leu Ser Asn Asp Trp Tyr Pro Ala Leu Xaa Lys 
290 295 300 



Pro Asn Val Ser Leu Val Thr Ser Gly Val Val Glu Val Thr Glu Xaa 
305 310 315 320 



Gly Val Val Asp Ala Asp Gly Val Glu His Glu Val Asp Thr He He 
325 330 335 



Phe Ala Thr Gly Phe His Xaa Thr Asp Xaa Pro Xaa Ala Met Lys He 
340 345 350 



Phe Gly Arg Glu Gly Arg Ser Leu Ala Asp His Trp Asn Gly Ser Ala 
355 360 365 



Xaa Ala Tyr Leu Gly Thr Ala Val Ser Gly Phe Pro Asn Leu Phe Xaa 
370 375 380 



Leu Leu Gly Pro Asn Thr Gly Leu Gly His Thr Ser He Val Xaa He 
385 390 395 400 



Leu Glu Ala Gin Ala Glu Tyr He Ala Ser Ala Leu Xaa Xaa Met Arg 
405 410 415 



Arg Glu Gly Leu Gly Ala Leu Asp Val Arg Ala Glu Val Gin Xaa Xaa 
420 425 430 



Phe Asn Xaa Ala Val Gin Glu Arg Leu Ala Thr Thr Val Trp Asn Ala 
435 440 445 
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Gly Gly Cys Ser Ser Trp Tyr Xaa Asp Pro Asp Gly Arg Asn Ser Thr 
450 455 460 

Xaa Trp Pro Trp Ser Thr Xaa Xaa Phe Arg Ala Arg Thr Arg Arg Phe 
465 470 475 480 

Asp Pro Ser Asp Tyr Xaa Pro Ser Ser Pro Thr Pro Glu Thr Xaa Xaa 
485 490 495 

Gly 

<210> 49 

<211> 471 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus sequence 
<220> 

<221> MISC_FEATURE 

<222> (22).. (22) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (25).. (25) 

<223> G or A or T or C 

<220> 

<221> MIS C_FEATURE 

<222> (28).. (28) 

<223> G or A or T or C 
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<220> 

<221> MISC_FEATURE 

<222> (31).. (31) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (62).. (62) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (72).. (72) 

<223> G or A or T or C 

<220> 

<221> MIS C_FEATURE 

<222> (75).. (75) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (85).. (85) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATORE 

<222> (86).. (86) 

<223> G or A or T or C 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<22X> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



MISC_FEATORE 
(93) . . (93) 
G or A or T or C 



MISC_FEATURE 
(95).. (95) 
G or A or T or C 



MISC_FEATURE 
(99) • • (99) 
G or A or T or C 



MISC_FEATURE 
(102) . . (102) 
G or A or T or C 



MISC_FKATURE 
(111) . . (Ill) 
G or A or T or C 



MISC_FEATURE 
(112) . . (112) 
G or A or T or C 
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<220> 

<221> MISCJFEATURE 

<222> (113).. (113) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (115).. (115) 

<223> G or A or T or C 

<220> 

<221> MISC_FBATURE 

<222> (123).. (123) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (124).. (124) 

<223> G or A or T or C 
<220> 

<221> MIS C_FEATURE 

<222> (125).. (125) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (131).. (131) 

<223> G or A or T or C 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(137) . . (137) 
G or A or T or C 



MISC_FEATURE 
(138) . . (138) 
G or A or T or C 



MISC_FEATURE 
(158) . . (158) 
G or A or T or C 



MIS C_FEATURE 
(160) . . (160) 
G or A or T or C 



MIS C_FE ATURE 
(166) . . (166) 
G or A or T or C 



MISC_FEATURE 
(170) . . (170) 
G or A or T or C 
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<220> 

<221> MISC_FEATURE 
<222> (188).. (188) 

<223> G or A or T or C 

<220> 

<221> MIS C_FEATURE 

<222> (193).. (193) 

<223> G or A or T or C 

<220> 

<221> MIS C_FEATORE 

<222> (195).. (195) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (196).. (196) 

<223> G or A or T or C 

<220> 

<221> MISC_FKATURE 

<222> (197).. (197) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (199).. (199) 

<223> G or A or T or C 



98 



WO 03/020890 



PCT/US02/27549 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



MISC_FEATURS 
(221) . . (221) 
G or A or T or C 



MISC_FEATURE 
(226) .. (226) 
G or A or T or C 



MI SC_FEATURE 
(228) . . (228) 
G or A or T or C 



MISC_FEATURE 
(229) . . (229) 
G or A or T or C 



MIS C_FEATURE 
(230) . . (230) 
G or A or T or C 



MI SC_FEATURE 
(231) . . (231) 
G or A or T or C 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(232) . . (232) 
G or A or T or C 



MISCJFEATURE 
(235) . . (235) 
G or A or T or C 



MIS C_FEATURE 
(236) . . (236) 
G or A or T or C 



MISC_FEATURE 
(240) . . (240) 
G or A or T or C 



MIS C_FEATURE 
(244) . . (244) 
G or A or T or C 



MISC_FEATURE 
(249) . . (249) 
G or A or T or C 
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<220> 

<221> MIS C_FE ATURE 
<222> (262) . . (262) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 
<222> (264).. (264) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (272).. (272) 

<223> G or A or T or C 

<220> 

<221> MI SC_FEATURE 

<222> (289) . . (289) 

<223> G or A or T or C 

<220> 

<221> MIS C_FE ATURE 

<222> (296).. (296) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATTJRE 

<222> (298).. (298) 

<223> G or A or T or C 
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<220> 

<221> MISC_FEATTJRE 
<222> (310) . . (310) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 
<222> (317).. (317) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (330).. (330) 

<223> G or A or T or C 

<220> 

<221> MIS C_FEATURE 

<222> (331).. (331) 

<223> G or A or T or C 

<220> 

<221> MIS C_FEATURE 

<222> (332).. (332) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (338).. (338) 

<223> G or A or T or C 
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<220> 

<221> MISC_FEATORE 

<222> (339) . . (339) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (347).. (347) 

<223> G or A or T or C 

<22Q> 

<221> MISC_FEATURE 

<222> (348).. (348) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (350) .. (350) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (356) . . (356) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (357) . . (357) 

<223> G or A or T or C 
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<220> 

<221> MIS C_FEATORE 
<222> (358).. (358) 

<223> G or A or T or C 

<220> 

<221> MIS C_FEATURE 

<222> (359).. (359) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (368).. (368) 

<223> G or A or T or C 

<220> 

<221> MIS C_FEATURE 

<222> (384).. (384) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (392).. (392) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (393).. (393) 

<223> G or A or T or C 
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<220> 

<221> MISC_FHATURE 

<222> (395).. (395) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (396).. (396) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (400).. (400) 

<223> G or A or T or C 

<220> 

<221> MIS C_FEATURE 

<222> (401).. (401) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (402). .(402) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (403) . . (403) 

<223> G or A or T or C 
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<220> 

<221> MISC_FEATURE 

<222> (404).. (404) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (407).. (407) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (410).. (410) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (411) . . (411) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (412) . . (412) 

<223> G or A or T or C 

<220> 

<221> MIS C_FEATURE 

<222> (413).. (413) 

<223> G or A or T or C 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



MIS C_FEATURE 
(421) . . (421) 
G or A or T or C 



MIS C_FEATURE 
(423). . (423) 
G or A or T or C 



MIS C_FEATURE 
(424) . . (424) 
G or A or T or C 



MISC_FEATURE 
(426) . . (426) 
G or A or T or C 



MISC_FEATURE 
(429) . . (429) 
G or A or T or C 



MISC_FEATURE 
(432) . . (432) 
G or A or T or C 
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<220> 

<221> MISC_FEATORE 

<222> (434).. (434) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (435).. (435) 

<223> G or A or T or C 

<220> 

<221> MI SC_FEATURE 

<222> (437).. (437) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (438).. (438) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (439).. (439) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (440).. (440) 

<223> G or A or T or C 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



MISC_FEATDRE 
(443) . . (443) 
G or A or T or C 



MISC_FEATURE 
(444) . . (444) 
G or A or T or C 



MISC_FEATURE 
(447) . . (447) 
G or A or T or C 



MISC_FEATORE 
(449) . . (449) 
G or A or T or C 



MISC_FEATURE 
(450) . . (450) 
G or A or T or C 



MISC_FEATUKE 
(451) . . (451) 
G or A or T or C 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



MISCJFEATURE 
(452) . . (452) 
G or A or T or C 



MISC_FEATURE 
(453) .. (453) 
G or A or T or C 



MISC_FEATURE 
(454) .. (454) 
G or A or T or C 



MIS C_FEATURE 
(455) .. (455) 
G or A or T or C 



MISC_FEATURE 
(456) .. (456) 
G or A or T or C 



MISC_FEATURE 
(457) . . (457) 
G or A or T or C 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(458) . . (458) 
G or A or T or C 



MISCJFEATURE 
(459) .. (459) 
G or A or T or C 



MISC_FEATURE 
(460) . . (460) 
G or A or T or C 



MISC_FEATURE 
(464) . . (464) 
G or A or T or C 



MIS C_FEATURE 
(465) .. (465) 
G or A or T or C 



MISC_FEATURE 
(466) . . (466) 
G or A or T or C 
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<220> 

<221> MISC_FEATORE 

<222> (468) . . (468) 

<223> G or A or T or C 

<220> 

<221> MISC_PEATURS 

<222> (469) . . (469) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (470) . . (470) 

<223> G or A or T or C 

<220> 

<221> MISC_FEATURE 

<222> (471) . . (471) 

<223> G or A or T or C 

<400> 49 

Met Ser Thr Glu His Leu Asp Val Leu lie lie Gly Ala Gly Leu Ser 
15 10 15 

Gly lie Gly Ala Ala Xaa Arg Leu Xaa Arg Glu Xaa Gly lie Xaa Phe 
20 25 30 

Ala lie Leu Glu Ala Arg Asp Asn Val Gly Gly Thr Trp Asp Leu Phe 
35 40 45 

Asn Tyr Pro Gly lie Arg Ser Asp Ser Asp His Leu Thr Xaa Gly Lys 
50 55 60 

Gly Ala Phe Arg Pro Phe Pro Xaa Ala Lys Xaa Leu Ala Asp Gly Pro 
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Ser His Glu Leu Xaa Xaa Tyr Val Arg Asp Thr Ala Xaa Glu Xaa Gly 



Leu Arg Xaa His lie Xaa Phe Gly Thr Lys Val Val Ala Ala Xaa Xaa 
100 105 110 



Xaa Ala Xaa Ser Leu Trp Thr Val Thr Val Xaa Xaa Xaa Gly Glu Thr 
115 120 125 



Glu Val Xaa Thr Tyr Asn Val Leu Xaa Xaa Ala Asn Gly Tyr Tyr Ser 
130 135 140 



Tyr Asp Lys Gly Asn lie Pro Asp Phe Pro Gly Glu Phe Xaa Gly Xaa 
145 150 155 160 



Leu Val His Pro Gin Xaa Tyr Pro Glu Xaa Leu Asp Tyr Arg Gly Lys 
165 170 175 



Lys Val Val Val lie Gly Ser Gly Ala Ser Gly Xaa Thr Leu Ala Pro 
180 185 190 



Xaa Met Xaa Xaa Xaa Ala Xaa His Val Thr Met Leu Gin Arg Ser Gly 
195 200 205 



Thr Tyr He Ala Leu Pro Ser Asp Ala Val Val Pro Xaa Gin Leu Ala 
210 215 220 



Gly Xaa Arg Xaa Xaa Xaa Xaa Xaa Leu Gin Xaa Xaa Gin Leu Arg Xaa 
225 230 235 240 



Leu Gly Lys Asn Val Xaa Leu Xaa Gly Phe Pro Thr Pro Ser Tyr Xaa 
260 265 270 



Pro Trp Asp Gin His Leu Cys Val Val Pro Asn Gly Asp Leu Leu Lys 
275 280 285 



Xaa Leu Gly Ser Gly Asp Ala Xaa He Xaa Thr Asp He Asp Thr Phe 
290 295 300 



Thr Gly Lys Gly Val Xaa Phe Ala Ser Gly Arg Glu Xaa Asp Ala Asp 
305 310 315 320 
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Val Val Val Thr Ala Thr Gly Leu Asn Xaa Xaa Xaa Gly Gly Pro Phe 
325 330 335 



He Xaa Xaa Asp Gly Leu Leu Val Asp Leu Xaa Xaa Arg Xaa Ala Leu 
340 345 350 



Phe Tyr Lys Xaa Xaa Xaa Xaa Ser Asp Asn Leu Asn Phe Leu Gly Xaa 
355 360 365 



Val Gly Tyr Thr Asn Ala Ser Trp Thr Leu Arg Ala Asp Leu Ala Xaa 
370 375 380 



Leu Val Ala Cys Arg Leu Leu Xaa Xaa Met Xaa Xaa Arg Ser Ala Xaa 
385 390 395 400 



Xaa Xaa Xaa Xaa His Ala Xaa Ala Glu Xaa Xaa Xaa Xaa Leu Leu Ala 
405 410 415 



Ser Gly Tyr Lys Xaa Arg Xaa Xaa Gly Xaa Met Pro Xaa Gin Gly Xaa 
420 425 430 



Lys Xaa Xaa Trp Xaa Xaa Xaa Xaa Asn Tyr Xaa Xaa Asp Arg Xaa Leu 
435 440 445 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Ser Lys Xaa 
450 455 460 



Xaa Xaa Ala Xaa Xaa Xaa Xaa 
465 470 



<210> 50 

<211> 19 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Primer HK12 

<400> 50 

gagtttgatc ctggctcag 
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<210> 51 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 51 

caggmgccgc ggtaatwc 18 

<210> 52 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer HK21 

<400> 52 

gctgcctccc gtaggagt 18 

<210> 53 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 53 

ctaccagggt aactaatcc 19 

<210> 54 

<211> 15 

<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 54 
acgggcggtg tgtac 

<210> 55 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 55 

cacgagctga cgacagccat 

<210> 56 

<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer HK13 

<400> 56 
taccttgtta cgactt 

<210> 57 

<211> 18 

<212> DNA 

<213> Artificial Sequence 



<223> Primer 
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<400> 57 

gwattaccgc ggckgctg 

<210> 58 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 58 

ggattagata ccctggtag 

<210> 59 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 59 

atggctgtcg tcagctcgtg 

<210> 60 

<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer HK15 

<400> 60 
gcccccgyca attcct 

<210> 61 



19 



20 



16 
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<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer HK14 

<400> 61 

gtgccagcag yragcggt 17 

<210> 62 

<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer JCR15 

<400> 62 

gccagcagcc gcggta 16 

<210> 63 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 63 

cggagcagat cgaww 17 

<210> 64 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> M13 Reverse Primer 

<400> 64 

caggaaacag ctatgac 

<210> 65 

<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> M13 (-20) Forward Primer 

<400> 65 
ctggccgtcg ttttac 

<210> 66 

<211> 34 

<212> DNA 

<213> Acinetobacter sp. NCIB 9871 

<400> 66 

gagtctgagc atatgtcaca aaaaatggat tttg 

<210> 67 

<211> 39 

<212> DNA 

<213> Acinetobacter sp. NCIB 9871 



<400> 67 

gagtctgagg gatccttagg cattggcagg ttgcttgat 



<210> 68 
<211> 25 
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<212> DNA 

<213> Brevibacterium sp. HCU 



<400> 68 

atgccaatta cacaacaact tgacc 



<210> 69 

<211> 23 

<212> DNA 

<213> Brevibacterium sp. 



<400> 69 

ctatttcata cccgccgatt cac 



<210> 70 

<211> 22 

<212> DNA 

<213> Brevibacterium sp. HCU 



<400> 70 

atgacgtcaa ccatgcctgc ac 



<210> 71 

<211> 21 

<212> DNA 

<213> Brevibacterium sp. HCU 



<400> 71 

cacttaagtc gcattcagcc c 



<210> 72 

<211> 21 

<212> DNA 

<213> Acinetobacter sp. SE19 
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<400> 72 

atggattttg atgctatcgt g 

<210> 73 
<211> 19 
<212> DNA 

<213> Acinetobacter sp. SE19 
<400> 73 

ggcattggca ggttgcttg 

<210> 74 

<211> 22 

<212> DNA 

<213> Arthrobacter sp. BP2 



<400> 74 

atgactgcac agaacacttt cc 



<210> 75 

<211> 18 

<212> DNA 

<213> Arthrobacter sp. BP2 

<400> 75 

tcaaagccgc ggtatccg 

<210> 76 

<211> 23 

<212> DNA 

<213> Rhodococcus sp. phil 



<400> 76 

atgactgcac agatctcacc cac 
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<210> 77 

<211> 22 

<212> DNA 

<213> Rhodococcus sp. phil 

<400> 77 

tcaggcggtc accgggacag eg 

<210> 78 

<211> 23 

<212> DNA 

<213> Rhodococcus sp. phi2 

<400> 78 

atgaccgcac agaccatcca cac 

<210> 79 

<211> 20 

<212> DNA 

<213> Rhodococcus sp. phi2 



<400> 79 

teagacegtg accatctcgg 



<210> 80 

<211> 21 

<212> DNA 

<213> Brachymonas sp. 



<400> 80 

atgtcttcct cgccaagcag c 



<210> 81 
<211> 21 
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<2X2> DNA 

<213> Brachymonas sp. CHX 
<400> 81 

cagtggttgg aacgcaaagc c 

<210> 82 

<211> 23 

<212> DNA 

<213> Rhodococcus erythropolis AN12 

<400> 82 

atgagcacag agggcaagta cgc 

<210> 83 

<211> 25 

<212> DNA 

<213> Rhodococcus erythropolis AN12 

<400> 83 

tcagtccttg ttcacgtagt aggcc 

<210> 84 

<211> 23 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 84 

atggtcgaca tcgacccaac etc 



<210> 85 

<211> 24 

<212> DNA 

<213> Rhodococcus erythropolis AN12 
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<400> 85 

ttatcggctc ctcacggttt ctcg 

<210> 86 
<211> 24 
<212> DNA 

<213> Rhodococcus erythropolis AN12 
<400> 86 

atgaccgatc ctgacttctc cacc 

<210> 87 
<211> 24 
<212> DNA 

<213> Rhodococcus erythropolis AN12 
<400> 87 

tcatgcgtgc accgcactgt tcag 

<210> 88 
<211> 23 
<212> DNA 

<213> Rhodococcus erythropolis AN12 
<400> 88 

atgagcccct cccccttgcc gag 

<210> 89 

<211> 24 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 89 

tcatgcgcga tccgccttct cgag 
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<210> 90 

<211> 24 

<212> DNA 

<213> Rhodococcus erythropolis AN12 

<400> 90 

gtgaacaacg aatctgacca cttc 

<210> 91 

<211> 23 

<212> DNA 

<213> Rhodococcus erythropolis AN12 

<400> 91 

tcatgcggtg tactccggtt ccg 

<210> 92 

<211> 22 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 92 

atgagcaccg aacacctcga tg 



<210> 93 

<211> 23 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 93 

tcaactcttg ctcggtaccg t 



<210> 94 
<211> 26 
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<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 94 

atgacagacg aattcgacgt agtgat 



<210> 95 

<211> 23 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 95 

tcagctctgg ttcacaggga egg 



<210> 96- 

<211> 23 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 96 

atggcggaga tagtcaatgg tec 



<210> 97 

<211> 22 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 97 

tcaccctcgc geggteggag tc 



<210> 98 

<211> 26 

<212> DNA 

<213> Rhodococcus erythropolis AN12 
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<400> 98 

gtgaagcttc ccgaacatgt cgaaac 

<210> 99 
<211> 25 
<212> DNA 

<213> Rhodococcus erythropolis AN12 
<400> 99 

tcatgcctgg acgctttcga tcttg 

<210> 100 

<211> 25 

<212> DNA" " • - 

<213> Rhodococcus erythropolis AN12 



<400> 100 

atgacacagc atgtcgacgt actga 



<210> 101 

<211> 24 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 101 

ctatgcgctg gcgaccttgc tatc 



<210> 102 

<211> 25 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 102 

atgtcatcac gggtcaacga cggcc 
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<210> 103 

<211> 24 

<212> DNA 

<213> Rhodococcus erythropolis AN12 

<400> 103 

tcatcctttg cctgtcgtca gtgc 

<210> 104 

<211> 24 

<212> DNA 

<213> Rhodococcus erythropolis AN12 

<400> 104 

atgactacac aaaaggccct gacc 

<210> 105 

<211> 22 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 105 

tcaggcgtcg acggtgtcgg cc 



<210> 106 

<211> 25 

<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 106 

atgacaacta ccgaatccag aactc 



<210> 107 
<211> 26 
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<212> DNA 

<213> Rhodococcus erythropolis AN12 



<400> 107 

tcagcgcaga ttgaagccct tgtatc 



<210> 108 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer A102FI for screening Arthrobacter sp. BP2 library 

<400> 108 

gcacacctac atcacccagc 

<210> 109 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer CONR for screening Arthrobacter sp. BP2 library 

<400> 109 
ccgcccaggt agaacag 

<210> 110 

<211> 24 

<212> DNA 

<213> Artificial 



<220> 

<223> Primer A228FI for screening Rhodococcus sp. phi2 library 
<400> 110 
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^► v %gatctcgat ccggcggtag ttgc 24 

<210> 111 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer A228RI for screening Rhodococcus sp. phi2 library 

<400> 111 

gctgatgccg accggtctgt acg 23 

<210> 112 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer A2PI for screening Rhodococcus sp. phil library 

<400> 112 

ccacagttgt cgacgccgtt gtc 23 

<210> 113 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer A34RI for screening Rhodococcus sp. phil library 

<400> 113 

tcgaaacctc ggtagctgtc gg 22 
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